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Introduction 


This companion book contains the solutions of the tutorial exercises which are found at the 
end of each chapter. Additional materials (datasets, codes, figures and slides) concerning 
the Handbook of Risk Management are available at the following internet web page: 
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Part I 


Risk Management in the 
Financial Sector 


Chapter 2 


Market Risk 


2.4.1 Calculating regulatory capital with the Basel I standardized mea- 


1. 


surement method 


(a) In the maturity approach, long and short positions are slotted into a maturity- 


based ladder comprising fifteen time-bands. The time bands are defined by dis- 
joint intervals |M-, M*]. The risk weights depend on the time band t and the 
value of the coupon!: 


K (t) | 0.00% 0.2096 0.40% 0.7096 1.2596 1.75% 2.2596 
Msc 1M 3M 6M 1Y 2Y 3Y 4Y 
Mg. 1M 3M 6M 1Y 19Y 2.8Y  3.6Y 
K (t) | 2.75% 3.2596 3.7596 4.5096 5.25% 6.00% 8.0096 
Msc 5Y TY 10Y 15Y 20Y +00 

Mga | 43Y  5.7Y | 7T.3Y | 9.3Y 106Y 12Y 20Y 


These risk weights apply to the net exposure on each time band. For reflecting 
basis and gap risks, the bank must also include a 10% capital charge to the 
smallest exposure of the matched positions. This adjustment is called the ‘vertical 
disallowance’. The Basel Committee considers a second adjustment for horizontal 
offsetting (the *horizontal disallowance'). For that, it defines 3 zones (less than 
1 year, one year to four years and more than four years). The offsetting can 
be done within and between the zones. The adjustment coefficients are 30% 
within the zones 2 and 3, 4096 within the zone 1, between the zones 1 and 
2, and between the zones 2 and 3, and 100% between the zones 1 and 3. To 
compute mathematically the required capital, we note £* (t) and S* (t) the long 
and short nominal positions for the time band t. t — 1 corresponds to the first 
time band [0, 1M], t = 2 corresponds to the second time band ]1M, 3M], etc. The 
risk weighted positions for the time band t are defined as £ (t) = K (t) x £* (t) 
and S (t) = K (t) x S* (t). The required capital for the overall net open position 
is then equal to: 


The matched position M (t) for the time band t is equal to min (£ (t) , S (t)). We 
deduce that the additional capital for the vertical disallowance is: 


13 
KYP = 10% x V M (t) 


t=1 


1 Coupons 3% or more are called big coupons (or BC) and coupons less than 3% are called small coupons 
(SC). When the maturity is greater than 20Y, K (t) is equal to 6.00% for big coupons and 12.50% for small 


coupons. 


— 
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N (t) = £ (t) — S (t) is the net exposure for the time band t. We then define the 
net long and net short exposures for the three zones as follows: 


£j - Y max (N (t) ,0) 
S; = - M min(N (t),0) 


where A; = [0,1Y], A2 = J1Y,4Y] and Ag = JAY, +00]. We define CF;,; as 
the exposure of the zone i that can be carried forward to the zone j. We then 
compute the additional capital for the horizontal disallowance: 


KHDP = 0.4 xmin(£,81) + 0.3 x min (£2, S2) + 0.3 x min (L3, S3) + 
0.4 x CF12 +0.4 x CF 23 + CFi3 


The regulatory capital for the general market risk is the sum of the three com- 
ponents: 
K = ROP + YP KBP 


For each time band, we report the long, short, matched and net exposures: 


Time band | £*(t) S*(t) KA | L(t) SHA M(t) Nit) 
3M-6M 100 50 0.40% | 0.40 0.20 0.20 0.20 
7TY-10Y 10 50 3.7596 | 0.45 2.25 0.45 —1.80 


'The capital charge for the overall open position is: 


KOP 


[0.40 + 0.45 — 0.20 — 2.25] 
1.6 


whereas the capital for the vertical disallowance is: 


KYP = 10% x (0.20 + 0.45) 
= 0.065 


We now compute the net long and net short exposures for the three zones: 


zone 1 2 3 
Li | 0.20 0.00 0.00 
Si | 0.00 0.00 1.80 


It follows that there is no horizontal offsetting within the zones. Moreover, we 
notice that we can only carry forward the long exposure £ to the zone 3 meaning 
that: 


KEP = 40% x 0.00 + 3096 x 0.00 + 3096 x 0.00 + 
40% x 0.00 + 40% x 0.00 + 10096 x 0.20 
= 0.20 


We finally deduce that the required capital is: 


K = 1.6+0.065 + 0.20 
$1.865 mn 
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2. (a) We have: 


Stock 3M Exxon IBM Pfizer AT&T Cisco Oracle 
Li 100 100 10 50 60 90 


Si 50 80 
Ni 100 50 10 50 60 90 —80 
We deduce that the capital charge for the specific risk is equal to $35.20 mn: 
7 
KSpecific — 8% x X` [A] = 8% x 440 = 35.20 
i=1 


(b) The total net exposure SaN, is equal to $280 mn, meaning that the capital 
charge for the general market risk is equal to $22.40 mn: 


General — 8% x 280 = 22.40 


(c) To hedge the market risk of the portfolio, the investor can sell $280 mn of S&P 
500 futures contracts”. In this case, the capital charge for the general market risk 
is equal to zero. However, this new exposure implies an additional capital charge 
for the specific risk: 


yc8pecific — 35.20 + 4% x 280 = 35.20 + 11.20 = 46.40 
Let S be the short exposure on S&P 500 futures contracts. We have: 


K = Kc8pecific HE K General 
(35.20 + 496 x S) +8% x |280 — S| 


We notice that there is a trade-off between the capital charge for the specific 
risk which is an increasing function of S and the capital charge for the general 
market risk which is a decreasing function of S for S < 280. Another expression 
of K (total) jg; 
e vm if S < 280 
~ | 12.80+12% x S otherwise 


We verify that the minimum is reached when S is exactly equal to 280 (see Figure 
2.1). 


3. (a) Under SMM, we have: 
KSMM = 8% x Nw 


(b) The 10-day Gaussian value-at-risk is equal to: 


Nw x o (w) 
VaRo9% (w; ten days) = 2.33 x — —2——— x v10 
99% ( y ) V260 


= 0457 x Ny x o (w) 
We deduce that the required capital is approximately equal to: 


KIMA gag (3-4 £) x VaRosy (w; ten days) 
(3 + £) x 0.457 x Nw x o (w) 


Because € < 1, it follows that: 


KEM < 1.828 x Ny x o (w) 


?We assume that the beta of the portfolio with respect to the S&P 500 index is equal to one. 


(c) 


(d) 


P sme 
eo 
Rae 
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40 80 120 160 200 240 280 320 560 400 


FIGURE 2.1: Capital charge K with respect to S 


A sufficient condition for KIMM < KSMM ig: 


1.828 x Nw x o (w) € 8% x Nw 
8% 


The annualized volatility of the portfolio must be lower than 4.37%. This im- 
plies that long equity exposures induce more required capital under IMM than 
under SMM. Indeed, the volatility of directional equity portfolios is generally 
higher than 12%. In order to obtain equity portfolios with such lower volatil- 
ity, the portfolio must be long/short, meaning that the directional risk must be 
(partially) hedged. 


The bank is exposed to foreign exchange and commodity risks with spot and 
forward positions. Contrary to stocks or many equity products, these exposures 
include a maturity pattern. For instance, the $100 mn EUR long position has 
not the same maturity than the $100 mn EUR short position, implying that the 
bank cannot match the two positions. 


We first consider the FX risk. We have Veur = 100—100 = 0, Mpy = 50—100 = 
—50, Ncap = 0— 50 = —50 and Neola = 50 — 0 = 50. We deduce that the 
aggregated long and short positions are Zpx = 0 and Spx = 100. It follows that 
the required capital is: 


KEX 


II 


8% x (max (Lex, 8rx) + Waoal) 
= 8% x (100 + 50) 
$12 mn 


— 


— 
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For the commodity risk, we exclude the Gold position, because it is treated as a 
foreign exchange risk. We have: 


7 7 
Commodity — 15% x X` 16; — Si] +3% x M (Li + Si) 
i=5 i= 
= 15% x (50 + 20 + 20) + 3% x 390 
13.50 + 11.70 


$25.20 mn 


II 


We finally obtain: 


K = KEX d K. Commodity 
$33.20 mn 


Under the maturity ladder approach, the bank should spread long and short ex- 
posures of each currency to seven time bands: 0-1M, 1M-3M, 3M-6M, 6M-1Y, 
1Y-2Y, 2Y-3Y, 3Y+. For each time band, the capital charge for the basis risk 
is equal to 1.596 of the matched positions (long and short). Nevertheless, the 
residual net position of previous time bands may be carried forward to offset 
exposures in next time bands. In this case, a surcharge of 0.696 of the residual 
net position is added at each time band to cover the time spread risk. Finally, 
a capital charge of 1596 is applied to the global net exposure (or the resid- 
ual unmatched position) for directional risk. To compute mathematically the 
required capital, we note £; (t) and S; (t) the long and short positions of the 
commodity 7 for the time band t. t = 1 corresponds to the first time band 
(0, 1M] and t = 7 corresponds to the last time band ]3Y, --oo[. The cumulative 
long and short exposures are £f (t) = £f (t — 1) + £i (t) with £7 (0) = 0 and 
S; (t) = S$ (t — 1) +S; (t) with S;* (0) = 0. The cumulative matched position is 
MIT (t) = min (£f (t) ,S;* (t)). We deduce that the matched exposition for the 
time band t is equal to M; (t) = M7 (t) — Mj (t — 1) with MẸ (0) = 0. The 
value of the carried forward CF; (t) can be obtained recursively by reporting the 
unmatched positions at time t which can be offset in the times bands 7 with 
T >t. The residual unmatched position is V; = max (£7 (7) S? (7)) — MF (t). 
We finally deduce that the required capital is the sum of the individual capital 
charges: 


i 6 
Ki = 1.596 x o x Mi o) + 0.696 x n o) +15% x N; 
t=1 t=1 


We notice that the matched position M; (t) is multiplied by 2, because we apply 

the capital charge 1.5% to the long and short matched positions. 

We compute the cumulative positions LF (t) and S; (t) and deduce the matched 

expositions M, (t): 
Time band _ 


LLA S(t) £i(t) Si) Milt) CZ) 


50 


t 

0—1M 1 0 300 500 300 300 200 

1M-3M 2 0 900 500 1200 200 700 
3M-6M 3 0 0 500 1200 0 700 
6M-1Y 4 1800 100 2300 1300 800 600 
1Y-2Y 5 300 600 2600 1900 600 300 
2Y-3Y 6 0 100 2600 2000 100 200 
3Y+ 7 0 200 2600 2200 200 0 
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The sum of matched positions is equal to 2200. This means that the residual 
unmatched position is 400 (2600 — 2200). At time band t = 1, we can carry 
forward 200 of long position in the next time band. At time band t = 2, we can 
carry forward 700 of short position in the times band t = 4. This implies that 
CF; (3) = 700 and CF; (4) = 700. At time band t = 4, the residual unmatched 
position is equal to 1000 (1800 — 100 — 700). However, we can only carry 600 
of this long position in the next time bands (300 for t — 5, 100 for t — 6 and 
200 for t — 1). At the end, we verify that the residual position is 400, that is the 
part of the long position at time band t — 4 which can not be carried forward 
(1000 — 600). We also deduce that the sum of carried forward positions is 2 700. 
It follows that the required capital is?: 


Ki = 1.596 x 4400 + 0.696 x 2700 + 1596 x 400 
$142.20 


2.4.2 Covariance matrix 
1. (a) We have: 
OA = V 314 = V4% = 20% 
For the other stocks, we obtain op = 22.36% and oc = 24.49%. 


(b) The correlation is the covariance divided by the product of volatilities: 


312 396 
Has) = : = = 67.08% 
p (Ra, Re) [Sx za 20% x 22.36% À 
We obtain: 
100.0096 
p= 67.08% 100.00% 
40.82%  —18.2696 100.00% 
2. (a) Using the formula X; j = p;,;010;, it follows that: 
1.0096 
£= | 1.00% 4.00% 
0.75% 0.00% 9.00% 
(b) We deduce that: 
a? (w) = 0.57 x 196 -F 0.5? x 4% +2 x 0.5 x 0.5 x 1% 


1.7596 


and e (w) = 13.23%. 
(c) It follows that: 


c? (w) 0.6? x 1% + (—0.4)? x 4% + 2 x 0.6 x (—0.4) x 1% 


0.52% 


and c(w) = 7.21%. This long/short portfolio has a lower volatility than the 
previous long-only portfolio, because part of the risk is hedged by the positive 
correlation between stocks A and B. 


3The total matched position is equal to 2 x 2200 = 4400 (long + short). 
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(d) We have: 


o? (w) = 150? x 1% +500? x 4% + (—200)? x 9% + 
2 x 150 x 500 x 1% + 
2 x 150 x (—200) x 0.75% + 
2 x 500 x (—200) x 0% 
= 14875 


The volatility is equal to $121.96 and is measured in USD contrary to the two 
previous results which were expressed in %. 


(a) We have: 


and: 


It follows that: 


cov(R) = 5 |(R- 2 [R]) (R — E [R M 


:[(e - Ete) (e - Ele" ] 
= ofBB +D 


c (Ri) = V 03-6? +6? 


We obtain c (R4) = 18.68%, c (Rpg) = 26.48% and o (Rc) = 15.13%. 


We deduce that: 


(b) The correlation between stocks 4 and j is defined as follows: 


c2 Bi; 
p(R;, R;) = CF ut ut EN 
(Fi R= z 08) (R) 
We obtain: 
100.00% 
p= | 94.62% 100.00% 
12.73% 12.98% 100.0096 
(a) We have: 
p(Z) = E[XiY] 
= E[Xj]E[Y;] 
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because X; and Y; are independent. For the covariance, we obtain: 


cov (Zi, Zj) = E[(Xi¥i) (X5Y;)] - E [X:Y;] E[X;Y;] 
= E[X;X;] E[Y;, Y;] - E[X;] E[X;] E [Y;] E [Y;] 
= (cov (Xi, Xj) + E[X;] E[X;]) x 
(cov (Yi, Y) + E[Y;] E [Y;]) — 
2 [X;] E[X;] E [Y;] E [Y;] 
= cov(X;,X;) cov (Yi, Y;)+ 
cov (X;, X;) E [Yi] E [Yj] + 
oe 2 [X;] E [X;] 
= Nj (X)X eo] 9 FX; j (X) ni (Y) Hj (Y) + 
Yi, Pu (X ) Hy (X) 


To obtain this formula, we use the fact that X; X; and Y;Y; are independent. In 
a matrix form, we find that: 


COV 


(Z) = n(X)on(Y) 
S(Z) = aie DW) 
X(X)ou(Y op(Y)' + 
z(Y)en(X)on(X)" 
(b) Using the numerical values, we obtain* u (Z) = 0 and: 
0.33396 
X(Z)- | 0.509; 1.33396 


0.188% 0.000% 3.000% 
The expression of the P&L is: 
II (w) = 1502, + 50022 — 20023 


We find that u (II) = 0 and c (II) = 69.79. We deduce that the Gaussian VaR 
with a 99% confidence level is equal to $162.36. For the Monte Carlo method, 
we use the following steps: (i) we first simulate the random variate X with the 
Cholesky algorithm; (ii) we then simulate Y with a uniform random generator; 
(iii) we calculate the components Z; = X;Y;; (iv) we finally deduce the P&L. 
With one million of simulations, we find that the Monte Carlo VaR is equal to 
$182.34. We explain this result because the distribution of II (w) is far to be 
normal as illustrated in Figure 2.2. 


2.4.3 Risk measure 


1. (a) We have: 
VaRa (L) = inf (£: Pr(L > £1 > a} 


and: 


ES, (L) = E [L | L > VaR, (L)] 


4We remind that the mean and the variance of the distribution U (0, 1) is 1/2 and 1/12. 
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0.008 p 


0.007 + Exact distribution 
===; Normal approximation 


0.006 


0.005 


0.004 


0.003 


0.002 


0.001 


0.000 


P&L (in $) 


FIGURE 2.2: Probability distribution of the P&L 


(b) We assume that F is continuous. It follows that VaR (L) = F~! (o). We deduce 
that: 


ESa (L) = E[L|L>F*(a)] 
=i fe) -a 
F-l(a) l— F(F^!(o)) 
= : xf (x) da 


l-a F-l(a) 


We consider the change of variable t = F (x). Because dt = f (x) dx and F (co) = 
1, we obtain: 


(c) We have: 


The non-centered moment of order n is?: 


12 
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co (641) 
up] = I 2^8 dr 


= 0 n 

= pan 
We deduce that: , 

[L| = =g 

L| = 3*4 
and: , 

I? = 2 

PA Saat 


The variance of the loss is then: 


var (D) = E |I?] - E? [L] = 


rz. is a scale parameter whereas 0 is a parameter to control the distribution tail. 


We have: 7 
—1 = 
i= (=) -— 
tH 


VaRa (L) = F`! (a) = z (1-0). 


We deduce that: 


We also obtain: 


1 1 —0-i 
T. 1 1-671 
= l-t 
l-a | 1—80-1 ( M 
0 —@-1 
= _(1 
p40) 
0 
> aaa VaRa 


Because 0 > 1, we have zi > l and: 
ESa (L) > VaRa (L) 


(d) We have: 


pog 1 1f/z—gWV 
ES, (L) = —/ x exp ( r) dz 
l-a ptob-l(a) oV2T 2 e 


5The moment exists if n Æ 0. 
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By considering the change of variable t = o^! (x — u), we obtain: 


1 ve 1 1 
ESa (L) = ae i + ot) =o ( 5e) dt 
H 


2g [® (t)]g (4) + 


eom [tes ( ie) 
(1— a) Vr $-1(a) p 2 
(0 


aoe | exp ( 22] eo 


ES aae ; 7 X") 


= br ca (a)) 


I 
= 


Because ¢’ (x) 2 —x@ (x), we have: 


i-ec) = foa 


= "i “(1 
D : 
We consider the integration by parts with u(t) = —t ! and v' (t) = Q (t): 


1-9(z) = | Zn [ oa 


t t2 
i: 2. a 5 (—t¢ (t)) dt 


We consider another integration by parts with u(t) = t^? and v' (t) =  (t): 


yeu 1. PU. EM T 3 A(t) dt 


x t3 t4 
_ lz) (x) T Aro) 
i x x f p? fide 


We continue to use the integration by parts with v'(t) = $ (t). At the end, we 
obtain: 


1-$(s = f +322) 35 + 
35.7012) _ 
= eu) ES EM 1)” (Ire 9) A 
_ (2) , VG) 
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5.0 


2.5 M 


2.0 


Probability (in Z) 


0.5 


0.0 


—— 1-4(x) 


= -— order 1 


s>s; order 4 


FIGURE 2.3: Approximation of 1 — ® (x) 


We have represented the approximation in Figure 2.3. We finally deduce that: 


ESQ(L) = 


II 


6(2) =r -80)) - £6) 
By using the previous expression of ES, (L), we obtain with z = ®~! (o): 
E aoa?! ^! (a)) 
HT dcm ) 
c E V (9-1 (a)) 

HT (1 Eb a) (: (a) (1 a) -1 (a) ) 

+00! (a [o = mn (o)) 
ptet (0-7 oy (a) 

V ($^! (a)) 

Veet) ceder a ER 


We deduce that ES, (D) > VaRa (L) because: 


For the Gaussian distribution, the expected shortfall and the value-at-risk coin- 
cide for high confidence level a. It is not the case with the Pareto distribution, 
which has a fat tail. The use of the Pareto distribution can then produce risk 
measures which may be much higher than those based on the Gaussian distribu- 


tion. 
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2. (a) We have: 


R(Liı + L2) = E[|Li + Lo] = E [L1] + E [L2] = R (Li) + R (La) 
R(AL) = [AL] = AE [L] = AR (L) 
R(L+m) = E[L—-m|-E[|L] - m 2 R(L) - m 


We notice that: 


C [L 35 za (o) = [7t t) dt 


— oo 


We deduce that if Fy (x) > Fo (x), then F1! (t) € F5! (t) and E[Li] < E [Ls]. 
We conclude that R is a coherent risk measure. 


(b) We have: 


R (Li + Lə) = E [Li + Lo] +o (Lı + L3) 
= E[|Lı] +E [Lə] + 
Vo? (L1) + 0? (L2) + 2p (L1, L2) a (14) o (L2) 


Because p (L1, L2) € 1, we deduce that: 


R (Lı + L2) < E[L] +E [L] + 
Vo? (L4) + 0? (L3) + 20 (L1) o (L2) 
< E[L +E [L2] + o (L1) 4- o (L2) 
€ R(Li) +R (L2) 
We have: 
R(AL) = E[AL] +o (AL) 
= AEL[L]|-4 Ac (L) 
— AR(L) 
and: 
R(L+m) = E[L—-m]-c c (L — m) 
= E([L|- m-oc(L) 
= R(L)—-m 


If we consider the convexity property, we notice that: 


R(ALi + (1—-A) £2) < R(ALı)+R((1-— A) L2) 
€ AR(Li) + (1— A) R (L2) 


We conclude that R is a convex risk measure. 


3. We have: 


fj 0 1 2 3 4 5 6 7 8 
Pr{L=¢;} 02 01 01 01 01 01 01 O1 0.1 
Pr{L <4} 02 03 04 0.5 06 0.7 0.8 0.9 1.0 
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(a) We have VaRsoy (L) = 3, VaRz5s; (L) = 6, VaRooy (L) = 7 and: 


3 x 1096 4- ... 3-8 x 10% 
ESsoy% (L) = : aU 5-55 
6 x 1096 4- ... - 8 x 10% 
ES75%(L) = : ape -=7.0 
7 x 10% +8 x 10% 
ESgoy,(L) = 2 5275 


(b) We have to build a bivariate distribution such that: 
F;' (o) + Fz! (o) < Frys (o) 


'To this end, we may use the Makarov inequalities. For instance, we may consider 
an ordinal sum of the copula C* for (u1,u3) € (o, o) and another copula Ca 
for (u1,u2) > (o, o) to produce a bivariate distribution which does not satisfy 
the subadditivity property. By taking for example a = 70% and Ca = C7, we 
obtain the following bivariate distribution: 


0 1 2 3 £ 5 6 7 CE Sg 
0.2 0.2 
0.1 0.1 
0.1 0.1 

0.1 0.1 

0.1 0.1 

0.1 0.1 

0.1] 0.1 

0.1 0.1 
0.1 0.1 
0.2 0.1 Ud 0.1 01 0.1 01-94 0l 


ONATKRWNR OS 


S 


We then have: 


Ü 0 2 4 6 8 10 14 
Pr(L,4-L;— 4) 02 01 01 01 OI O1 03 
Pr{Li +25 <4} 02 03 04 05 06 0.7 110 


Because Fy ' (80%) = Fz‘ (80%) = 6 and FT}, (80%) = 14, we obtain: 
F;! (80%) + F7! (80%) < F1, (80%) 


2.4.4  Value-at-risk of a long/short portfolio 
We note S44 (resp. Sp.) the price of stock A (resp. B) at time t. The portfolio value 
is: 
P; (w) = wA SAt + we Spt 
where wA and wp are the number of stocks A and B. We deduce that the P&L between t 
and t+ 1 is: 
II (w) — Pia P, 

= wa(Sat+i — Sat) + we (SB, t+1— Sp) 

= waSartRatsi + we SB¢RB t+ 

= WarRatsi t WgBgtRB t41 


6We have pı i = Pr {L1 = £j) and po,; = Pr {L2 = £i). 
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where RA,,1 and Rp, are the asset returns of A and B between t and t+ 1, and WA, 
and Wg, are the nominal wealth invested in stocks A and B at time t. 


1. We have WA, = +2 and Wey = —1. The P&L (expressed in USD million) has the 
following expression: 
IL(w) = 2RAa41 — Rp, 


We have II (w) ~ N (0, 0? (ID)) with: 


e(I) = wQoeAY + (Cen)! +2pa,p x (204) x (0n) 
= y4 x 0.202 + (—0.20)? — 4 x 0.5 x 0.202 
= v3 x 20% 
34.64% 


The annual volatility of the long/short portfolio is then equal to $346 400. We consider 
the square-root-of-time rule to calculate the daily value-at-risk: 


1 
VaRggs, (w; one day) = rie $-! (0.99) x V3 x 20% 


= 5.01% 
The 99% value-at-risk is then equal to $50 056. 


2. We use the historical data to calculate the scenarios of asset returns (R.4 441, RB,t+1)- 
We then deduce the empirical distribution of the P&L with the formula IL(w) = 
2HRA441 — Rg t+. Finally, we calculate the empirical quantile. With 250 scenarios, 
the 1% decile is between the second and third worst cases: 


VaR os; (w; one day) 


1 
— |-56850 + 5 (—54270 — (—56850)) 
— 55560 


The probability to lose $55 560 per day is equal to 1%. We notice that the difference 
between the historical VaR. and the Gaussian VaR. is equal to 11%. 


3. If we assume that the average of the last 60 VaRs is equal to the current VaR, we 
obtain: 


KIMA — me x V10 x VaRggs, (w; one day) 


K™A is respectively equal to $474877 and $527088 for the Gaussian and historical 
VaRs. In the case of the standardized measurement method, we have: 


pcSpecific _ 2x 8%+1x 8% 
$240 000 
and: 
KC General = [2 = 1| x 8% 
$80 000 
We deduce that: 
KSMM ix Kc Specific ae K General 


= $320000 
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The internal model-based approach does not achieve a reduction of the required capital 
with respect to the standardized measurement method. Moreover, we have to add the 
stressed VaR under Basel 2.5 and the IMA regulatory capital is at least multiplied by 
a factor of 2. 


. If pa,B = —0.50, the volatility of the P&L becomes: 


y4 x 0.202 + (—0.20)? — 4 x (—0.5) x 0.202 
= VT x 20% 


c (I) 


We deduce that: 


VaRa (pa,B = —50%) | co (I; pa,p = —50%) "B E 
zal 


VaRa (p4,p = 45096)  o (II; pa,p = +50%) 


The value-at-risk increases because the hedging effect of the positive correlation van- 
ishes. With a negative correlation, a long/short portfolio becomes more risky than a 
long-only portfolio. 


. The P&L formula becomes: 


IL(w) 2 W4 tRA t41 + WeeReesi — (Cassi — Cat) 
where C4, is the call option price. We have: 
CA = Cat ~ A; (SA 441 = S A4) 


where A, is the delta of the option. If the nominal of the option is USD 2 million, we 
obtain: 


IIl(w) = 2RA4A- Rg —2x0.5 x RA 
and: 
2 
c(Il) = (0.202 + (—0.20)* — 2 x 0.5 x 0.20? 
— 2096 


If the nominal of the option is USD 4 million, we obtain: 


II(w) = 2R4- Rg -Ax 0.5 x RA 
= -Rpg (2.2) 


and c (II) = 20%. In both cases, we have: 


VaR gs; (w; one day) x 671 (0.99) x 20% 


1 

v/260 
= $28900 
The value-at-risk of the long/short portfolio (2.1) is then equal to the value-at-risk of 
the short portfolio (2.2) because of two effects: the absolute exposure of the long/short 
portfolio is higher than the absolute exposure of the short portfolio, but a part of the 
risk of the long/short portfolio is hedged by the positive correlation between the two 
stocks. 
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6. We have: 
IL(w) = Wa RA — (CB t41 - Cp) 


and: 
CB t+1 — Cp; cA. (Sp. = Sp.) 


where A, is the delta of the option. We note x the nominal of the option expressed 
in USD million. We obtain: 


Il(w) = 2R4A-—-x*x Ax Rg 
x 
x 2RA = aks 
We have’: 
$ p ob DE z 
o (II) = 404+ 1 +2p4,B X (20,4) x (- ex) 
i 


= ee (2^ — 8pA,pt + 16) 


Minimizing the Gaussian value-at-risk is equivalent to minimizing the variance of the 
P&L. We deduce that the first-order condition is: 


Oo" (D 93 a 


We deduce that the minimum VaR is reached when the nominal of the option is 
x = 4pA, p. We finally obtain: 


OA 
c (II) 7 V 1603, s — 320% 5 + 16 


2044/1 -B 


and: 


1 
VaRgg% (w;one day) = T x 2.33 x 2x 20% x \/1— p^ p 


5.78% x 4/1- pp 


If pA,p is negative (resp. positive), the exposure z is negative meaning that we have 
to buy (resp. to sell) a call option on stock B in order to hedge a part of the risk 
related to stock A. If p4,B is equal to zero, the exposure x is equal to zero because a 
position on stock B adds systematically a supplementary risk to the portfolio. 


R 


2.4.5 Value-at-risk of an equity portfolio hedged with put options 
1. Let R = (Ra, Rg) be the random vector of stock returns. We remind that?: 


cov (R) = o? (Rr) 8B! +D 


where 6 = (84, 6g) and D is the covariance matrix of idiosyncratic risks. 


TBecause c4 = og = 20%. 
8See Exercise 2.4.2 on page 8. 
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(a) We deduce that o (Rj) = 4/820? (Rr) + 65. We obtain 
o (Ra) = V0.5? x 4% + 3% = 20% 


c (Rp) = V1.52 x 4% + 7% = 40% 


The cross-correlation is: 


and: 


= c? (Rr) Babs A 4% x 0.5 x 1.5 = 
p (Ra, Re) = c(Ra)c(Rg)  20%x40% d 


(b) To find the correlation between the stocks and the index, we can proceed in two 
different ways. 


i. We consider the random vector R = (R4, Rp, Rr). The formula cov (R) = 
c? (Rr) B8' + D is still valid with 63 = By = 1 and D33 = 67 = 0%. We 


obtain: 


_ 0? (Rr) Babr | 4% x0.5x1 _ 
p (Ra, Fa) = 7 0A) o (Rr) EC A — 5096 


and: 
| G?(Rri)BsBr 4% x15x1 _ 


ii. The definition of beta 5; is: 


_ cov(R;,Rr)  p(R;, Rr) o (Rj) o (Rr) 


LE qi o? (Rr) 
It follows that: (Ry) 
Fir g I d 
PU Bi) = Tg yh 


We retrieve the previous formula. 


(c) The correlation matrix is then equal to: 


100.0% 
p= | 37.5% 100.096 
50.0% 75.0% 100.0% 


We deduce that the covariance matrix X is: 


4% 
y= | 3% 16% 
2% 6% 4% 


2. Let w = (wa, we, wr) be the composition of the portfolio. The expression of the P&L 
between t and t£ + h is: 


Il(w) = wa (Saztn— Sat) + we (SBt4+n — Spt) + wa (Srttn — Sra) 
= waSartRattn + wBSpailigaun + WISI RI t+h 
= WARAst+n t+ Wekettn + WRI t+h 
= W' Rin 
where W; is the current wealth invested in asset j, W = (Wa, Wg, Wr) is the vector 
of dollar notionals and Ri+n = (Razin, RB t+h, RI,t+h) is the random vector of asset 
returns. 
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(a) We have W4 = 400, We = 500 and W; = 250. We deduce that: 
o’ (Il) = W'xW 
= 400? x 4% + 500? x 16% + 250? x 4% + 


2 x 400 x 500 x 396 + 2 x 400 x 250 x 2% + 
2 x 500 x 250 x 696 


We find that c (II) is equal to $28267. Using the square-root-of-time rule?, it 
follows that: 


2 
VaRggs, (w; ten days) =  -!(9996) x 282.67 x ETT 
$128.96 


I 


The 99% quantile corresponds to the 2.6" order statistic of the sample. The 
historical value-at-risk is then the interpolated value between the second and 
third largest losses: 


P EE 
= 


VaRggos, (w;one day) = 55.23 — (2.6 — 2) x (55.23 — 52.06) 
— $53.33 


We deduce that the 10-day VaR is: 


VaRggs, (w; ten days) = v10 x VaRog% (w; one day) 
$168.64 


(c) If we assume that the average of the last 60 VaRs is equal to the current VaR, 


we obtain: 
K™A = m, x VaRggs, (w; ten days) 
K1MA is respectively equal to $387 and $506 for the Gaussian and historical 


VaRs. In the case of the standardized measurement method, we have!?: 


Specific .— (W4 + Wp) x 8% + Wr x 4% 

= 900 x 8% + 250 x 4% 

= $82 
and: 

General — |Wa+ Wet Wr| x 8% 
$92 
We deduce that: 
KSMM a KcSpecific E K General 
$174 


9We use the following correspondence: 10 days is equivalent to 2 weeks and one year is equivalent to 52 
weeks. 
10We assume that the specific capital charge for an equity index is 496. 
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3. Let z be the number of put options. The expression of the P&L becomes: 


Il (w) = WARA, t+n + WeRe tin + WrRrcun + v (Pis Pr) 


where P, is the value of the put option at time t. Under the delta approach, we have: 


Pon —P ~ A. (Srt+h oa S; 4) 
= ASIRI t+h 


We deduce that: 


II (w) = WAR Acn + WeRptt+n + (Wr + LASI) Rin 


Using the numerical values, we obtain: 


(a) 


II (w) = 400R4t+n + 500R pci + (250 — 12.52) x Rretn 


To hedge 50% of the index exposure, the number of put options must satisfy the 
following equation: 
250 — 12.5r = 125 


'The portfolio manager must purchase 10 put options. In this case, the expression 
of the P&L becomes: 


II (w) = 400R 4 t+h + 500 p t+h + 125Rrtth 


and the 10-day Gaussian VaR. is equal to $119.43. 
We have: 


II(w) = 400Rat+n + 500RB44n + (250 — 12.52) x Rr t+h 
= 400(BaRrt+n +€attn) + 500 (8p Rr t+h + eBttn) + 
(250 — 12.52) x Rr t+h 
= (1200 — 12.52) x Rrign + 400 x £A tin +500 x EB t+h 


As the index return is not correlated with the idiosyncratic risks, minimizing the 
VaR is equivalent to minimizing the beta exposure in the index: 


1200 


doc 


'The purchase of 96 put options is required to remove the directional risk. In this 
case, the P&L reduces to: 


TI (w) = 400 x £A,t+h + 500 x EB tin 


and its volatility becomes: 


c (II) = /400? x 3% + 5002 x 7% = $149.33 


We deduce that the minimum 10-day VaR is equal to $68.13. In Figure 2.4, we 
show the evolution of the VaR with the number of purchased options. We verify 
that the minimum is reached for x = 96. 
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VaR (in $) 


0 25 50 75 100 125 150 175 200 


Number of put options 


FIGURE 2.4: Value of the 10-day VaR with respect to the number of purchased options 


2.4. Risk management of exotic derivatives 
Let C; be the option price at time t. The P&L of the trader between t and t + 1 is: 
T= — (Cy41 — Ci) 


The formulation of the exercise suggests that there are two main risk factors: the price of 
the underlying asset 5; and the implied volatility X+. We then obtain: 


II = C; (St, Se) — Cua ($5, Xu) 
1. We have: 


H 
l 


Ci (St, X) — Cua (St+1, Et+1) 
1 
—4; (St+1 = St) — git (Si = Si)? — Ut (Eua = Y) 


2 


Using the numerical values of A;, T; and v;, we obtain: 


1 2 
Il & —0.49 x (97 — 100) — 5 x 0.02 x (97 — 100) 
= 147-0.09 
1.38 


We explain the P&L by the sensitivities very well. 
2. We have: 
Il — Cua (Sia, Ytri) — Cu (St42, Yta) 
—Awca (St — Sta) - -m (S142 — St41)? — 
+1 (Zi — Xia) 


Q 


v 


+ 
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Using the numerical values of Az41, F441 and v4.41, we obtain: 


II 


2 


1 
—0.49 x 0 — 5 x 0.02 x 0? — 0.38 x (22 — 20) 
= -0.76 


To compare this value with the true P&L, we have to calculate C41: 


Cii = Ce (Ci — Cura) 
= 6.78 — 1.37 
5.41 
We deduce that: 
II = Carm Cu 
= 5.41 -— 6.17 
—0.76 


Again, the sensitivities explain the P&L very well. 


. We have: 
TE =  Cu2(Su2,3X442) — Crus (S103, X63) 
1 
& —Ar2 (St+3 — 945) — zri (St+3 — Sa) = 
Ur42 (L443 — X42) 


Using the numerical values of A45, F442 and vi425, we obtain: 


1 
II œ —044 x (95 — 97) — 5 x 0.018 x (95 — 97)? = 


0.38 x (19 — 22) 
= 0.88 — 0.036 4- 1.14 
= 1.984 


The P&L approximated by the Greek coefficients largely overestimate the true value 
of the P&L. 


. We notice that the approximation using the Greek coefficients works very well when 


one risk factor remains constant: 


(a) Between t and t+1, the price of the underlying asset changes, but not the implied 
volatility; 


(b) Between t+ 1 and t 4- 2, this is the implied volatility that changes whereas the 
price of the underlying asset is constant. 


Therefore, we can assume that the bad approximation between t+ 2 and t+ 3 is due 
to the cross effect between S, and X. In terms of model risk, the P&L is then exposed 
to the vanna risk, meaning that the Black-Scholes model is not appropriate to price 
and hedge this exotic option. 
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2.4.7 P&L approximation with Greek sensitivities 


1. We note C, = C; (Si, 34, T) meaning that the option price depends on the current 
price S+, the implied volatility X, and the maturity date T. The delta of the option is 
the first derivative of C, with respect to S+. 

OC; (St, 34, T) 

Ap ORE HO y 

' ð S, 

whereas the gamma is the second derivative: 

ae 9? C, (Si, X T) 

m OS? 

The theta of the option is the first derivative of C; with respect to the time t. We 

have: 


I, 


©; = OF Cr (St, Le, T) 
For the vega coefficient, we have: 


— OC; (St, X4, T) 
B 9 Y, 


Ut 
2. Let r; and b, be the interest rate and the cost-of-carry parameter. We note 7 =T —t 
the residual maturity. The Black-Scholes formula is: 
C; = Spet") (di) —_Ke "™® (d2) 
with: 


1 S: 1 
= saa (gether) +5 wr 


dg = di-*Xvyr 


To calculate the Greek coefficients, we need the following preliminary result: 


Ke™T o (d2) = S,e 77979 (d) (2.3) 
Indeed, we have: 
(də) = óé(d svT) 
1 1 2 
= xw 7 (di Yay) ) 
1 1 
- ep ( 5 (a — 2d Sev + zi) 
1 
= (di) exp (avr - 522r) 
S 
= d(di)exp (1m x + br) 
S 
== eee (di) 
The derivation of Equation (2.3) is then straightforward. We deduce that: 
Od Od 
A, = et") (dy) + S,e(7797 9 (di) PES — Ke^"** à (dg) 28, 
Od Od 
= el) (dj) + Spel) o (di) ae: — Ke™" o (d) PR 
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We have: 
Od, Ode 1 


0S, O08, XiT 


We finally obtain: 


S,e(^i-707 6 (di) — Ke-"t7 e (d2) 


A, = eto (d) 4 So VE 
tot 


= e% (di) 
The expression of the gamma is therefore: 


eO 7707 9 (di) 


T = 
' SX VT 
To calculate the theta, we first calculate the derivative of dı and də with respect to T: 
Odi | 1 In St bi Qi 
Or ^. Orr K 2Xr AyT 
ð d» — 1 In St b, dy 
Or o — 2XQr K YT 4yr 


We deduce then: 
ô- C, (St, Xu T) = (b, D rt) S,e-r0T @ (di) + rıKe "76 (d2) + 


Od Od 
S777 $ (d1) 57 — Ke" o (da) 5 


= (b, E rt) S,e i-r) ap (di) + raKe "70 (d2) + 
Odi Ode ) 


seen (SF 5 


= (b, — rt) ENDE (di) t+rKke 8705 (d2) + 


» » 
(b. —rs)T t t 
= (b, = rt) Sret) TD (di) t+rKke 770 (d2) + 
1 
5; 93e "e (di) 


The expression of the theta coefficient is then the opposite of 0; C, (Sz, X4, T). For the 
vega coefficient, we obtain: 


v = Se™= o (d1) SE — Ke" 0 (da) a 
-oseecmsen (s - s) 
We have: 
Di o cse (mtn) eiv 
-i sre (nz br) 3⁄7 


It follows that: 
vi = SeVrel"™7 6 (di) 
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3. (a) To calculate Co, Ao, To and Oo, we consider the Back-Scholes formulas with b; = 
ri. We have C, = S;® (di) — Ke~"™7® (da), A, = 6 (di), Ti = Q (d1)/ (Sj3a T), 
©, = —ruKe "7 ® (dg) — $4349 (d1)/ (2/7) with the following numerical values: 
Sı = 100, X, = 20%, r = 1 and r; = 5%. We notice that the option price is a 
decreasing function of the strike, because it is a convex function of the strike. 
'The delta and gamma coefficients are positive. T'he delta is a decreasing function 
with respect to K. This is not the case of the gamma. The theta of the call option 
is negative, because the time value decreases with the residual maturity. 


uiri. 
e 


We apply the Black-Scholes formula C, = S;® (d1)— Ke "** 6 (dz) with S, = 102, 
Y, = 20%, r, = 5% and T = 1 — 1/252 because the residual maturity is one year 
minus one trading day. We deduce that the P&L of a long position on this option 
is II = Ci m Co: 


K 80 95 100 105 120 
II 1.852 1.464 1.285 1.099 0.589 


(c) We obtain the following results: 


K 80 95 100 105 120 
II^ 1.857 1.456 1.274 1.084 0.574 
wath 1.871 1.489 1.311 1.124 0.608 
[^79 1.839 1.432 1.249 1.060 0.556 
IIA4^"*9 1.853 1.465 1.287 1.100 0.590 


The approximation of the P&L by the Greek sensitivities is very accurate. 


(d) We obtain the following results: 


K 80 — 95 100 105 120 
JI —— 45.386 42.001 40.026 37.596 28.090. 
i^ 44.575 34.930 30.568 26.027 13.785 


Att 52.424 54.058 52.182 48.877 33.412 
Ie 42.186 31.793 27.361 22.888 11.445 
I[A^"*9 50.036 50.912 48.975 45.739 31.071 


In this case, the approximation of the P&L by the Greek sensitivities is not very 
good. Indeed, the remaining maturity is now six months meaning that (1) the 
theta effect is not well measured and (2) the price of the underlying asset has 
changed significantly. In this situation, the delta P&L is overestimated. 


2.4.8 Calculating the non-linear quadratic value-at-risk 


1. We have: 


Hx] = Fi BG ae) ae 


— oo 


E s "lag (x) dx 
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Using the integration by parts formula, we obtain!!: 


-Foo 
5X oem [727^-16 (a)] ** + (2n — Df z2^—29 (x) dz 


I 


+00 
(2n — yf z2^—26 (x) dz 


— oo 


= (2n-1)E[X?"7] 


We deduce that E[X?] = 1, E[X4] = (2x2-1)E[X?] = 3, E[X®] = 
(2x3-1)E[X*] = 15 and E[X5] = (2x 4— 1) E[X^] = 105. For the odd mo- 
ments, we obtain: 


+00 
< [Aem - i gent ó (2) dr 


because z?"*16 (x) is an odd function. 
2. Let C; be the value of the call option at time t. The P&L is equal to: 
IL(w) = Ci, — €i 
where h is the holding period!?. We also have Styn = (1 + Ri+n) St where Ri+n is the 
asset return. We notice that the daily volatility is equal to: 
_ 32.2576 


= V860 : 


We deduce that R45, ~N (0,4 bps). 
(a) We have: 


II (w) Au (Sui — St) 


A45, Ri+n 
It follows that II (w) ~ M (0, A26?52) where c is the volatility of Ry.) and: 
VaR q (w; h) = 9^ (o) Ail o S; 


Q 


II 


The numerical application gives VaRa (w; h) = 2.33 dollars. 
(b) In the case of the delta-gamma approximation, we obtain: 


1 
Il (w) a A: (Stn — St) + jt: (St+h — Sp 
1 
= Aquae + 5 FRE S; 


We deduce that: 


: 1 
[I] = Ai Ris nSe + seein St 


I 


sree Hos | 9? 


1 
ghee Si 


l1because ¢! (x) = —a¢ (x). 
12Here, h is equal to one day. 


nN 
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and: 


E[(IP = E 


1 2 
(aiias * DNE) | 


: 1 
= E [aint TAGES? LE] 


We have Rin = oX with X ~ N (0,1). It follows that: 


3 

5 [117] = A?o* S? + ilio Si 

because E[X] = 0, E[X?] = 1, E[X?] = 0 and E[X*] = 3. The standard 
deviation of the P&L is then: 


c (II) 


Il 


3 1 
i A025} + P Tess] - (rs) 


1 
= "T + 5 Fio Sr 


Therefore, the Gaussian approximation of the P&L is: 
1 1 
II (w) ~ N (Gries? Ajo? S? + To's} 


We deduce that the Gaussian value-at-risk is: 


1 1 
VaR, (w; h) = -3T:78; 4-9 (a) T + Fre 5; 


The numerical application gives VaRq (w; h) = 2.29 dollars. 


Let L — —II be the loss. We recall that the Cornish-Fisher value-at-risk is equal 
to (FRM, page 88): 


VaRo (w; h) = u (L) + à (a; 71 (D) ya (2)) -o (D) 
with: 


à(o; 09) = Zad 5 (2a 1) 1 4 2 (23 — 324) 34 — 


Bed 
and z = $-(a). yı et y2 are the skewness and excess kurtosis of the loss L. 
We have seen that: i 
II (w) = Ayo SX + zro SEX 


with X ~ N (0,1). Using the results in Question 1, we have E [X] = E [X°] = 
| [X] = E [X7] =0, E[X?] = 1, E [X4] = 3, E[X®] = 15 and E [X*] = 105. 
We deduce that: 


E[P| = E [ates + jAm,etsx + 


1 

j | A iet + Tietse x"] 
9 15 

= 5 ATi Se + gli 5 
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and: 
1 4 
CI] = E (Aes + jets) | 
45 105 
= 3Ajo*8;-- z Argot St + ig i05 
The centered moments are then: 
) (u _E mp? = E [n?] — 3E [rn] E [r?] + 2? (rrj 
9 15 3 
= 5 AtTio* S; + lie’ st — 5 AtTio* S; = 
9 


2 
-I369588 + grin Sr 


= R + Toss 
and: 
) [ui s :[u*| = E [D$] — 4E [n] E [n°] + 6E? n] E [r?] — 
3E* [I1] 
= 3Afotst + B AIT 70°S? + S Ties? = 
9AIT}0°S? — PT 48s} + 
SAIL 7o°S? + Ties — Š rios 


15 
= 3A1c018;-- 15A2?T759 SÉ + Gos 


It follows that the skewness is: 


*4(DL = -y (I) 
: [ar - E qm? 
H c? (II) 


3A2T,0^8; + T2085 
(A2o?S2 + 1p294g2)?? 
6V2A7T 0454 + 2/21 70°98 
(2A260?82 + T20454)3/? 


whereas the excess kurtosis is: 


ya(L) = (I) 
[ar - :[u*| 
^ wq 


3A10*5? + 15A2T7685$ + T1035? 
(A202582 + ir20492)? 

12A2T2655$ + 30 40° S8 

(A2o2S? + 4920454)? 
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Using the numerical values, we obtain u (L) = —0.0400, e (L) = 1.0016, yı (L) = 
—0.2394, y2 (L) = 0.0764, 3 (a; 51, 72) = 2.1466 and VaR, (w; h) = 2.11 dollars. 
The value-at-risk is reduced with the Cornish-Fisher approximation because the 
skewness is negative whereas the excess kurtosis is very small. 


(a) We have: 
X'AX 
= (sry 51/2 Ay/2 (Sex) 


XTAX 
with A= X12AY12, X ~N (n, Ù), i = X? and È = I. We deduce that: 


[Y] = jA'Aü-tr(A) 
= pT Ap + tr (x? Ax?) 
= u'Au+tr(AX) 
and: 
var(Y) = E[Y?]- E?^[Y] 


= 4A Å i+ 2tr (A’) 
= 4u" ADAp + 2tr (zi Ax Ax?) 


= 4u AXAg-2tr ((4z)’) 


(b) For the moments, we obtain: 


3Y] = tr(A¥) 
[Y] = (tr(AD))? +2tr (Ax?) 

£[Y?] = (tr (A£)? 6t (A2) tr (CA?) 86 (Cx?) 
[Y] =  (tr(Ad))* 32 t (AX) tr ((A)) + 


2 (tr Cai + 12(tr(AE))? tr ((AEZ)*) + 
48tr ((AD)*) 


It follows that the first and second centered moments are u (Y) = tr (AX) and 
var (Y) = 2tr ((Az)’). For the third centered moment, we have: 


[Y -EQp?) = E[v?]-sE[Y?] EY] - 2E? [Y] 
= (tr (AX)? + 6tr (AE) tr T ji 
s(n") -s 
6tr((4 z)’) tr ( tr (AX) + 2 (tr (AX)? 
») 


= str ((A9 $* 
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The skewness is then equal to: 


n(Y) = m 
3 


We obtain for the fourth centered moment: 


[er - p^ = E[Y*] -4E[Y?] E [Y] + 6E [Y?] EY] - 
3E" [Y] 
= (tr (AX))* + 32tr (AX) tr ((45)*) 4 


( 
12 (tr (rax?) + 48tr ((ax)*) 


12 (tr (AB)? tr ((AB)*) — 4 (tr (A2)* - 
24 (tr (AX)? tr (Cx?) = 
32 tr ((AX)? ) tr (AD) + 6 (tr (A£))f + 


(A^) 
12tr ((Ax)?) (tr CAE)? — 3 (tr (AE)) 
= 12 (tr Copi +48tr ((4z)*) 
It follows that the excess kurtosis is: 


w(¥) = 12 (tr (dam?) +48 (can?) 3 


(eu (am) 


12tr ((Ax5*) 
m) 


II (w) =w! (Cin — Ce) 


4. We have: 


where C; is the vector of option prices. 


(a) The expression of the P&L is: 


H(w) ~ w (Aro (Siya — St)) 
= w ((Azo S+) o Rith) 
= Al Reh 


where A, is the vector of delta exposures in dollars: 


Ait = Wi Mi Sit 


— 
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Because Ri+n ~ N (0, X), it follows that II ~ NV (0, A/ XA;). We deduce that 
the Gaussian value-at-risk is: 


VaR (w; h) = 97! (o) / A XA; 


'The risk contribution of option i is then equal to: 


-t (a) (ZA) Aiii 


A} A, 
Ait (XA), 
= $^! (a) it ( e); 


VAIXA, 
In the case of the delta-gamma approximation, we obtain: 
II(w) & w (Aro (Sn $0) + 
ju (T o (Sian — Se) 0 TRE $)') w 


$ 1 z 
= ÅJ Run+ PLUR Res 


where T; is the matrix of gamma exposures in dollars: 
Pi jt = wiwjTiji9i4554 


We deduce that: 


"A 1 L 
) [LT] = i [AT Ress + 5 aT Res 


[Rial Rita] 


NIe Ot 


tr (Td) 


and: 


pm? 


var (II) = 
ae Pcs xe dre es i 
A, Fun + 5 Pia dtes D 2 tr (T5) 


[u 
- | 

(A 

[(A 


= 1 x - 
t Ris) | + 4 y [CN pnl tRith — tr (£i) + 


Ri+n) Rip Rak —tr (£,35))] 


| ( 


t 


"ES 1 a 
= E[(AT Rua) | + 7 var (RI Ren) 


~ ~ 1 ~ 2 
= AEA, +5 tr((f2)’) 
Therefore, the Gaussian approximation of the P&L is: 
1 ~ ~ ~ 1 ~ 
IL (w) ~N G tr (Č.2), AEA, + zt (x) 


We deduce that the Gaussian value-at-risk is: 


VaRa (w; h) = -i tr (F£) 4 $^! (a) ATA. + ju ((.)*) 
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(c) If the portfolio is delta neutral, A; is equal to zero and we have: 
lor z 
II g Fia Pies 


Let L — —II be the loss. Using the formulas of Question 3(b), we obtain: 


u (D) = -; tr (Ñ) 
o (L) = 5 tr (fv) 
2V/2tr (x) 
Jı (L) = es^ 
"T 126 ((Pv9)*) 


MULT 
(tr ((f.2)*) ) 
We have all the statistics to compute the Cornish-Fisher value-at-risk. 


(d) We notice that the previous formulas obtained in the multivariate case are per- 


fectly coherent with those obtained in the univariate case. When the portfolio is 


not delta neutral, we could then postulate that the skewness is!?: 


6V2AT EDDA, + 2v2tr (Eu)? 


E (2À7XA, Jtr ((5)?)) ^ 


In fact, it is the formula obtained by Britten-Jones and Schaeffer (1999). 


Or 
WEN, 

v 
— 


Using the numerical values, we obtain (ZL) = —78.65, o (L) = 88.04, 44 (D) = 
—2.5583 and y2 (L) = 10.2255. The value-at-risk is then equal to 0 for the delta 
approximation, 126.16 for the delta-gamma approximation and —45.85 for the 
Cornish-Fisher approximation. We notice that we obtain an absurd result in the 
last case, because the distribution is far from the Gaussian distribution (high 
skewness and kurtosis). If we consider a smaller order expansion: 


1 1 
à (05,2) = Za + 6 (25 = 1) aF 24 (23 = 3za) ^ya 


the value-at-risk is equal to 171.01. 


(b) In this case, we obtain 126.24 for the delta approximation, 161.94 for the delta- 
gamma approximation and —207.84 for the Cornish-Fisher approximation. For 
the delta approximation, the risk decomposition is: 


Option Wi MR; RC; RC; 
1 50.00 0.86 42.87 33.96% 
2 20.00 0.77 15.38 12.19% 
3 30.00 2.27 67.98 53.8596 
OR(w) -. 12624 


13You may easily verify that we obtained this formula in the case n = 2 by developing the different 
polynomials. 
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For the delta-gamma approximation, we have: 


Option Wi MR; RC; RC; 
1 50.00 4.06 202.92 125.3196 
9 20.00 1.18 23.62 14.59% 
3 30.00 1.04 31.10 19.2196 

JR(w) -. 16194 = 


We notice that the delta-gamma approximation does not satisfy the Euler de- 
composition. 


2.4.9 Risk decomposition of the expected shortfall 
1. We have: 


It follows that: 


with u (w) = w' u and o (w) = Vw! dw. 


2. The expected shortfall ES, (w; h) is the average of value-at-risks at level a and higher: 


ESq (w; h) = E[L | L > VaRa (w;h)] 
We know that the value-at-risk is: 
VaR, (w; h) = —w' u + 671 (a) Vw! Sw 


We deduce that: 


senri fate (-4(SH2)) e 


where x = —u(w) + ^l(o)o(w). With the change of variable t = 
o(w)' (x + pi (w)), we obtain: 


E ee xc EN (=u (0) +0 (w) i) exp (°) dt 
= HO) eet 


= -p(w)+ we | exp ( 7). e 
= =o) + A ep (Ee?) 


The expected shortfall of portfolio w is then: 


ES, (w; h) = -w'u + e OU wi dw 
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3. The vector of marginal risk is defined as follows: 


MR = ð PS (uh h) 
a _—o(@ *(a)) Xw 
"T (=a) VT Ew 


We deduce that the risk contribution RC; of the asset i is: 


—Wipi + 


It follows that: 


SRC: = 3 Wili 4 o (7! (a)) Wi X (£w); 


ici (1-a) vri 
vut PUn 9) uf Gee) 
b (1-a) Vwldw 
= ES, (u;h) 


The expected shortfall then verifies the Euler allocation principle. 
4. We have: m " 
-Sun-Yu 
i=1 i=1 
with L; = —w;Rj;. We know that: 
RC; = ` [L; | L > VaRa (w; h)] 
2 [Li -1 {L > VaRa (w; h)} 


i [1 {L > VaRa (w; h)}] 
i [L;i - 1 {L > VaRa (w; h))] 


l-a 
We deduce that: 
RC: = a L[R; - L {R (w) € — VaRa (w; h)}] 
We know that the random vector ( (w)) has a multivariate normal distribution: 


We deduce that: 


(nt )^ «(C 3I) 
(nte UM 316799) 


Let J = E[R;-1(R(w) < — V: w;h)}]. We note f the density function of the 


( 
random vector (Fi, R (w)) and p = X; “1/2 (w Sip a (Sw), the correlation between 
R; and R (w). It follows that: 


I = f. [asm (w; h)) -rf (r,s) drds 


— oo 


Loo p—VaRa(w) 
/ / rf (r,s) dr ds 


I 
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Let t = (r — mi) /A/Xi; and u = (s — w! u) /Vw' £w. We deduce that": 


to qe 0-8 ys + Ut t? + u2 — 2pt 
r= fo D Hi E: So ( M oe dt du 
Qnv/1 — p? 2(1-— p?) 


By considering the change of variables (t, u) = ọ (t, v) such that u = pt + y1 — g?v, 
we obtain!” 
to 9) iy + Yat ro 
I = J Í la — exp ae dt du 
E ET 2m 2 


+œ p9(0 4 12 2 
E uf i: ze ( T) atau + 
--oo g(t) t t2 2 
a> /. ze ( =) atau + 
= ph y Mil» 


where the bound g (t) is defined as follows: 


For the first integral, we havel? 


Sg i? ay “1 v? 
h = e —— e ——|d dt 
i —oo v2m Sa ( 2 ) Lb v2n aE ( 2 ) i 


foe (scc o (t) dt 
S 1 — p? 


= l-a 


II 


The computation of the second integral I> is a little bit more tedious. Integration by 
parts with the derivative function tø (t) gives: 


une $-!(1— o) — pt 
b J. CERT to (t) dt 
2 (2 1(1— a) — pt 
ENTE 1—p? 
nlla SÉ xps f gp ee een) 
= Perdu a ael UI Ja 
= -pé(9 (0-2) 


Because we have $7! (1 — a) = —-! (o). 
15We use the fact that dtdv = 4/1 — p? dt du because the determinant of the Jacobian matrix containing 


the partial derivatives Dy is 4/1 — p2. 
$-1(p)— pT 
Elo (p) =P E 
/1-p? 


IE 


16We use the fact that: 


where T ~ N (0, 1). 
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We could then deduce the value of I: 


I = m(l-a)- pyi (8 (1 -a)) 
g p (=a) - 29 (8 (a) 


We finally obtain that: 
(9! (a)) wi x (Zu) 
(1-a)  vVwTXw 


We obtain the same expression as found in Question 3. Nevertheless, the conditional 
representation is more general than the Gaussian formula, because it is valid for any 
probability distribution. 


RC; = —wipi + $ 


2.4.10 Expected shortfall of an equity portfolio 
1. We have: 


II = 4(Pattin — Pat) + 3(PBt+h— Pp) 
= A4PA4RA t+n + 3Ppa Ran 
400 x Rasen + 600 x Raven 


where RA +n and Rp, are the stock returns for the period [t,t + h]. We deduce 
that the variance of the P&L is: 


400 x (2596)? + 600 x (20%)? + 
2 x 400 x 600 x (—20%) x 25% x 20% 
19 600 


c? (II) 


We deduce that c (II) = $140. We know that the one-year expected shortfall is a linear 
function of the volatility: 


o (9! (a)) 
l—o« 


= 2.34 x 140 
$327.60 


ES, (w; one year) x c (II) 


The 10-day expected shortfall is then equal to $64.25: 


/ 10 
ES, (w;ten days) = 266 * 327.60 


$64.25 


2. We have: 
II, = 400 x Ry, + 600 x Rgs 


We deduce that the value II, of the daily P&L for each scenario s is: 


s 1 2 3 4 5 6 7 8 
II, 36 —10 24.96 -12 -30 -l4 —16 
IT ou —36 —30 —26 -24 -16 -14 -12 S10 
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The value-at-risk at the 97.5% confidence level correspond to the 6.25" order statis- 
ticl”. We deduce that the historical expected shortfall for a one-day time horizon is 
equal to: 


ES. (w;one day) = —E/[II | II € — VaR, (II)] 
1 
d 5 Hs:250 
s=l1 


1 
= | (36 +30 +26 +24 + 16 + 14) 
= 24.33 


By considering the square-root-of-time rule, it follows that the 10-day expected short- 
fall is equal to $76.95. 


2.4.11 Risk measure of a long/short portfolio 
We have: 
Mith = 2(Paten — Pat) —5(Pettn — Pp) 
= 2P4atRattn —5PB+RB +h 
100 x (Rat+n — RB t+h) 


where Razin and Rpg,t+n are the stock returns for the period [t,t + h]. 
1. We deduce that the (annualized) variance of the P&L is: 


c? (Is t4260) = 100? x (2596)? + 100? x (2096)? — 
2 x 100? x 12.596 x 2596 x 2096 
— 900 


We have c (IL,141260) = $30. It follows that the 10-day standard deviation is equal to: 


10 
o (IMit+10) = 260 * 7 (Ii, t+260) 
= $5.883 
(a) We obtain: 
VaRggs, (w;ten days) = 71 (99%) x c (I t+10) 
$13.69 
(b) We have: 
ot 
ES, (w; ten days) = eR Cn) x o (Ili t+10) 
—a 
and: 


o (9-1 (97.5%)) 


A 
1 — 97.5% So 


c 2.34 


The 10-day expected shortfall is then equal to $13.75. 
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TABLE 2.1: Order statistic I.:250 of the daily P&L 


E 1 2 3 4 5 6 7 8 9 10 
IL;.250 6.3 6.0 5.1 4.8 —4.6 —4.5 4.3 4.3 4.0 3.9 


2. Given the historical scenario s, the one-day simulated P&L is equal to: 


II; = 100x (Ras — Rp) 
= 100x D, 


The order statistic II5.559 of the daily P&L is given in Table 2.1. 


(a) We deduce that the one-day value-at-risk at the 99% confidence level corresponds 
to the 2.5"¢ order statistic: 


—6.0— 5.1 


VaR os; (w; one day) = — ( 2 


) — $5.55 


It follows that: 
VaRgo; (w; ten days) = v10 x VaRog% (w, one day) = $17.55 


(b) The value-at-risk at the 97.5% confidence level correspond to the 6.25*^ order 
statistic. We deduce that the historical expected shortfall for a one-day time 
horizon is equal to: 


6 
1 
ESo7.5% (w; one day) -§ X II5:250 
s—1 


1 
= 3 (6.3 + 6.0 + 5.1 + 4.8 + 4.6 + 4.5) 
= $5.22 


By considering the square-root-of-time rule, it follows that the 10-day expected 
shortfall is equal to $16.50. 


In Basel II, the capital charge is equal to: 


— 
e 
WN 


K 3 x VaRooy (w; ten days) 


$52.65 
In Basel 2.5, the capital charge becomes: 


K 3 x VaRogy (w; ten days) +3 x SVaRg9% (w; ten days) 


$157.96 


Il 


where SVaR is the stressed value-at-risk. In Basel III, we obtain: 


K = 2x SESg9% (w; ten days) 
$65.99 


where SES is the stressed expected shortfall. 


17We have 2.5% x 250 = 6.25. 
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2.4.12 Kernel estimation of the expected shortfall 
1. We have: 


IX -1{X<a}]} = f ossa Y) dt 


Il 
sir 
M4: 
is 
me 
e 
=~ 
>| | 
3 
V—— d 
e 
+ 


We consider the change of variable u = h^! (t — z;): 


E a= [’ (z; - hu) K (u) du 


z—cri Ln; 


= Re ik (u) dut f * ull (u) du 


— oo 


We deduce that: 


UX-T1(Xxaz)] = 2» z;K (u) du + 


2. We have: 


II 

| = 
B8 

N 
Lem 

8 

Ex 
NS 


3. Since we have: 


we deduce that: 


=fr 


fi aarda 


oo 


It follows that: 


—> Owhenh0 


4. Finally, we obtain the result: 


L[X-d1(X <r} x Yar eJ 


iml 


41 


42 Handbook of Financial Risk Management 


We conclude that: 


ESq(w;h) = [L (w) - 1 (D (w) > VaRa (w;h)}] 


ier ? [IE (w) - 1 {Il (w) € — VaRa (w; h)}] 


because we have IL(w) = —L (w). 
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Credit Risk 


3.4.1 Single and multi-name credit default swaps 
1. We have F (t) = 1—e^, S (t) = e^ and f (t) = e^". We know that S (T) ~ U1). 
Indeed, we have: 
Pr{U <u} = Pr(S(rT)xu) 
= Pr{r 287") } 
= 8 (S! (u)) 
EM 
It follows that r = S^! (U) with U ~ Ui. Let u be a uniform random variate. 
Simulating 7 is then equivalent to transform u into t: 


1 
t=—-lnu 


À 
2. (a) The premium leg is paid quarterly. The coupon payment is then equal to: 
PL (tm) = Atmxsx N 
Z ; x 150 x 1074 x 109 
= $3750 


In case of default, the default leg paid by protection seller is equal to: 


DL = (1-R)xN 
= (1-— 4096) x 10° 
$600 000 


'The corresponding cash flow chart is given in Figure 3.1. If the reference entity 
does not default, the P&L of the protection seller is the sum of premium interests: 


Ireller = 8 x 3750 = $30000 


If the reference entity defaults in one year and two months, the P&L of the 
protection buyer is!: 


p" = (1-R)xN- M AtuxsxN 


iT 
2 
= (1-— 4096) x 109 — c i) x 3750 


—  $582500 


1We include the accrued premium. 
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The protection buyer receives $600 000 


if the defaults occurs before the maturity 


Qi Q2 Q3 Q4 Q5 Q6 


The protection buyer pays $3 750 


each quarter if the defaults does not occur J 


M 


FIGURE 3.1: Cash flow chart of the CDS contract 


(b) Using the credit triangle relationship, we have: 


$c (1—R) xA 
We deduce that?: 
PD >~ A 

" S 
~ 1-R 
|. 150x104 
B 1 — 4096 
= 2.50% 


(c) We denote by s’ the new CDS spread. The default probability becomes: 
s! 

1-R 

450 x 1074 

1-409. 

= 7.50% 


PD = 


The protection buyer is short credit and benefits from the increase of the default 
probability. His mark-to-market is therefore equal to: 
ever = Nx (s’—s) x RPVo 
109 x (450 — 150) x 1074 x 1.189 
$35 671 


?We recall that the one-year default probability is approximately equal to A: 
PD = 1-S(1) 
= 1—e^ 
1—(1— A) 
AÀ 


R R 
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The offsetting mechanism is then the following: the protection buyer B transfers 
the agreement to C, who becomes the new protection buyer; C continues to 
pay a premium of 150 bps to the protection seller A; in return, C pays a cash 
adjustment of $35 671 to B. 


For a given date t, the credit curve is the relationship between the maturity T 
and the spread 5; (T). The credit curve of the reference entity #1 is almost flat. 
For the entity #2, the spread is very high in the short-term, meaning that there is 
a significative probability that the entity defaults. However, if the entity survive, 
the market anticipates that it will improve its financial position in the long-run. 
This explains that the credit curve #2 is decreasing. For reference entity #3, we 
obtain opposite conclusions. The company is actually very strong, but there are 
some uncertainties in the future?. The credit curve is then increasing. 


If we consider a standard recovery rate (4096), the implied default probability is 
2.50% for #1, 10% for #2 and 1.33% for #3. We can consider a short credit posi- 
tion in #2. In this case, we sell the 5Y protection on #2 because the model tells 
us that the market default probability is over-estimated. In place of this direc- 
tional bet, we could consider a relative value strategy: selling the 5Y protection 
on #2 and buying the 5Y protection on #3. 


Let Tren be the kt” default among the basket. FtD, StD and LtD are three CDS 
products, whose credit event is related to the default times Ti:n, To.4 and Tn:n- 


'The default correlation p measures the dependence between two default times 
7, and Tj. The spread of the FtD (resp. LtD) is a decreasing (resp. increasing) 
function with respect to p. 


To fully hedge the credit portfolio of the 3 entities, we can buy the 3 CDS. 
Another solution is to buy the FtD plus the StD and the LtD (or the third-to- 
default). Because these two hedging strategies are equivalent, we deduce that: 


spp ES a 


Es sods = gFtD 4 StD 4 çLtD 

We notice that the default correlation does not affect the value of the CDS 
basket, but only the price distribution between FtD, StD and LtD. We obtain a 
similar result for CDO?. In the case of the subprime crisis, all the CDO tranches 
have suffered, meaning that the price of the underlying basket has dropped. The 
reasons were the underestimation of default probabilities. 


3.4.2 Risk contribution in the Basel II model 


1. 


(a) 


The portfolio loss L follows a Gaussian probability distribution: 
L(w)~N (o. VwT Ew) 


We deduce that: 
VaR, (w) = $^! (a) Vw! Sw 


3An example is a company whose has a monopoly because of a strong technology, but faces a hard 
competition because technology is evolving fast in its domain (e.g. Blackberry at the end of 2000s). 
4The junior, mezzanine and senior tranches can be viewed as FtD, StD and LtD. 
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(b) We have: 
ð VaRa (w) | 8 = T 1 
Ju ou (è (a) (w' £w) ) 
1 —i 
= 6 (o) 5 (w'Xw) ? (2Xw) 
“uw 
= 6 (g)——— 
( ) Tat Se 
The marginal value-at-risk of the it? credit is then: 
9 VaR, (w) E (Xw), 
MR; = ———__ = © a) ——— 
ð wi (o) vw Tw 
th 


The risk contribution of the i 
marginal risk: 


credit is the product of the exposure by the 


= Wi X (Xw), 
= © 1 a i 
(o) Val da 


c) By construction, the random vector (e, L (w)) is Gaussian with: 
y 


Gm)" (C0) Cate ms )) 


We deduce that the conditional distribution function of € given that L(w) = £ 
is Gaussian and we have: 


je|L(w)-4-20-4Xw(w'Xw) (£-0) 


We finally obtain: 


Zw (w' Sw) $^! (a) Vw! Sw 
Mw 
va T Sw 
ð VaRa (w) 
Ow 


[e| L (w) = F^! (o)] 


II 


$^ (a) 


The marginal VaR of the i*^ credit is then equal to the conditional mean of the 
individual loss £; given that the portfolio loss is exactly equal to the value-at-risk. 


(a) EAD; is the exposure at default, LGD; is the loss given default, 7; is the default 


time and 7; is the maturity of the credit 7. We have: 


ci = LGD; x1 {7; < T;} 


The exposure at default is not random, which is not the case of the loss given 
default. 


(b) We have to make the following assumptions: 


i. the loss given default LGD; is independent from the default time 7j; 
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ii. the portfolio is infinitely fine-grained meaning that there is no exposure 


concentration: 
EAD; 


Xi EAD; ~ 


iii. the default times depend on a common risk factor X and the relationship is 
monotonic (increasing or decreasing). 


In this case, we have: 


b [ex | L = F^ (a)] = E [LGD;] x E[D; | L = F^ (a)] 


(c) It follows that: 


RC; = wx MR; 
EAD; xE [LGD;] x E [D; | L = F7! (o)] 


'The expression of the value-at-risk is then: 


Y nc 
i=l 


E Y^ EAD, xE[LGD,] x E [D; | L = F^! (a)] 


VaRa (w) 


Il 


(d) i. We have 


ZZ] = E (vox + V1 - pes) (vox + vI = ø5)] 


= p 


p is the constant correlation between assets Z; and Zj. 


ii. We have: 
pi = Pr{n<Ti} 
© (Bi) 
iii. It follows that: 
= Pr [ VpX + J1- p: < B: | X =a} 


LB id 


x-2] 


= 


ste) 


ib 


= 0 


i) 


Pr {es 
su 
= 
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(e) Under the assumptions (H), we know that: 


L = S EAD, xE[LGD,] x p; (X) 


= Y^ EAD, xE|LGD;] x e (He) 


= g(X) 


with g' (x) < 0. We deduce that: 


VaRa (w) =F! (œa) & Príg(X) < VaR4 (w)! — o 
& Pr{X >g (VaRa(w))} =a 
e Pr{X <g7'(VaRa(w))} 21- o 
& g |(VaR4(w)) = 9 ! (1— o) 
€ VaR4(w) 2 g (6^! (1—)) 
It follows that: 
VaRa (w) = g(®7'(1—a)) 
= ) EAD; xE[LGDj] x p; (^! (1— o)) 
t=1 
The risk contribution RC; of the ith credit is then: 
RC; = EAD; xE[LGDj] x p; (#71 (1— o)) 
$-! (pj) — /pb-! (1— 
= EAD, xE [LGD;] «e( Pree ow 2) 
vl-p 
-t " -t 
= EAD; xE [LGD;] x 6 (mil PR NG 
vip 


3. (a) We note Q the event X < g^! (VaRa (w)) or equivalently X < $^! (1— a). We 
have: 


ESa (w) = E[L|L 7 VaR, (w)] 
= E[L|g(X) > VaRa (w)] 
= TAE (VaRa (w))] 


— E y BAD: L[LGD;] x p; (X) | Q 
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(b) It follows that: 
i19] = K — i) = 2 el PEN a 


vi-p 
7 $-!(1—a) a 1( ee =P 
= oer + ee) 
mI ae da 
9$ (6-1 (1— a)) 
— (27 (17 0),97! (p): y?) 
l-a 
_ € - oi vp) 
l-a 


where C is the Gaussian copula. We deduce that: 


C(1—- i; 
RC; = EAD; xE[LGD,] x (1 = api; VP) 


l-a 
(c) If p = 0, we have: 
$-! (p; 9-1 (a s 
»( pe ( ? Es d (o> (pi) 
— Pi 
and: 

C(I-ep:yp _  (L=a)pi 

l-a l-a 


= Pi 


The risk contribution is the same for the value-at-risk and the expected shortfall: 


= E[L] 


It corresponds to the expected loss of the credit. If p = 1 and a > 50%, we have: 


® (= poe 2) : e (= Chas ey 


If p= 1 and o is high (a > 1 — sup; pi), we have: 


C(1- opii vP) _ min(l-oip) 


l-a l-a 
= 1 


In this case, the risk contribution is the same for the value-at-risk and the ex- 
pected shortfall: 


However, it does not depend on the unconditional probability of default p;. 


. Pillar 2 concerns the non-compliance of assumptions (71). In particular, we have to 
understand the impact on the credit risk measure if the portfolio is not infinitely 
fine-grained or if asset correlations are not constant. 
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3.4.3 Calibration of the piecewise exponential model 


1. We have: 
S(t)=Pr{r>t}=1-F(t) 


and: 


f(t) =O F (t) = -àS(t) 


2. The function A (t) is the instantaneous default rate: 


1 
= i — <7r<t+ = 
Aà (t) um APrits7 st A|r2t) 

_ 1Pr(tE TELA) 

= lim 
A>0+ A Pr {zr >t} 

2 1 : | Prta T Rd] 
^  Prf{r >t} asot A 
_ fl 

(t) 

In the case of the exponential model, we obtain: 
Ae 
A(t) = oe em À 


3. Since T ~ E (A), it follows that: 


(1—R)x]f, ef (t) dt 
fo e-"tS (t) dt 

(1- R) x fy ete! dt 
Je e-rte-^ dt 

= Ax(1-R) 


s(T) = 


4. (a) We define the survival function as follows: 


e ^t ift «3 
S(t) — 4 e-3--3) if3<t<5 
e73à1—2à2—A3(t-5) ift>5 


We deduce that: 


Aet ift<3 
f) 2 4. Agel) if3<t<5 
Age 341-2A2—Aslt-5) if t > 5 


We verify that the hazard rate is a piecewise constant function: 
ài ift<3 


———-— Ag if3<t<5 
A3 ift>5 
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(b) Let (t1,15,t3) be the knots of the piecewise exponential model?. We note that 
S (t) = S (t5, ,) ev 675-9) and f (t) = AmS (t, ,) e? 07). When T € 


[c D ONE it follows that: 
des x fo ai dt 
7 eS (t) d 
a (t) det f a F (0 dt) 


un 'e-"tS (t) dt-- f e-"'S (t) at) 


We introduce the following notation with T € th: 


T 
LOL T = j e "'S (t) dt 


P E XS. 
Sit geese Tee 
— en 
e (r + àm) 
= S(t a er Tem (T7 i) 


We obtain the following cases: 
i. If T < 3, we have: 


x fo e"! f (t) dt 
fo e-"'S (t) dt 
(1 — R)xAU x Z(0,T) 
I(0,T) 
= (1—-R)xA 


s(T) = 


ii. If 3 <T € 5, we have: 
(L-R) x (fo ef (E) dt + Ja e" f (5 dt) 
(I e-"tS (t) dt + f. e-"tS (t) dt) 


s(T) = 


= AZ (0,3) + A2 (3, T) 
= U= R] X- T03) tIGT) 
iii. If T > 5, we have: 
s(T) =(1—R) 1Z (0,3) + A2Z (3,5) + As (5, T) 


T (0,3) + Z (3,5) + Z(5,T) 


5We use the convention tò = 0: 
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(c) The parameters (A1, A», A3) satisfy the following set of equations: 


$(3)=(1-R) x Ay 
s(5) = (1- R) x S95 ER (3.1) 
— A1Z(0,3)+A2Z (3,5) 3-A3 (5,7) 
$(7)=(1-R) : T(03YFIG 5 EIG.T) 


From the first equation, we estimate Ma: 


« _ s3) 
AUC ATO) 


We can now solve numerically the second equation and we obtain PT Finally, 
we solve the nonlinear third equation to obtain Aa. This iterative approach of 
calibration is known as the bootstrapping method. 


(d) When r is equal to zero and Àm is small, the function Z (t, ,, T) becomes: 


T (th T) 


m-—1» 


I 

un 
— 
~ 
3 * 
5 

wa 
Pd 


ec Tt 


m-—1 


We introduce the following notation: 


_ S(T) 
GE ER 
Using Equation (3.1), we deduce that: 
AL = A(3) 
s T (0,3) + Z (3,5) ES Sm 
& - (Ae m - s 
5A (5) — 3A (3) 
2 
& c TIQG3-4(,54Hz(57) MZ (0,3) + Ao (3,5) 
^s ce ( 1 (5,7) jam T (5,7) 
T(7) — 5A (5) 
2 
We notice that: 
s(3) = ü-X)xÀ 
s(5) = a-m x (Ae) 


s(7) = a-m x (ed 


The spread is then a weighted average of the different hazard rates, whose weights 
are proportional to the interval time between two knots. 


Credit Risk 53 


(e) Using a numerical solver, we obtain Ai = 166.7 bps, À» = 401.2 bps and À4 = 
322.4 bps®. 

Since S (T) ~ Ujo,1), the simulated default time t is S^! (u) where u is a uniform 
random number. If u > S (3), we havee ^! = u or t = —M lnu. If S(5) < u < 
S (3), it follows that S (3) e? *-9 = u or t= 3 + A5! (In S(3) — In u). Finally, 
we obtain t = 5 + A3! (In S (5) — In u) if u < S (5). Using the previous numerical 
values, we find that S (3) = 0.951 and S (5) = 0.878. The simulated default times 
are then: 


(f 


— 


2.449 for u — 0.96 
46.54 for u — 0.23 
4.380 for u — 0.90 
7.881 for u = 0.80 


3.4.4 Modeling loss given default 
1. The loss given default is equal to: 
LGD=1-R-+c 


where c is the recovery (or litigation) cost. Consider for example a $200 credit and 
suppose that the borrower defaults. If we recover $140 and the litigation cost is $20, 
we obtain R = 70% and LGD = 40%, but not LGD = 30%. 


2. The amounts outstanding of credit is: 


EAD = 250000 x 50000 
$12.5 bn 


The annual loss after recovery is equal to: 
L = EADx(1—®) x PD+C 
= 43.75 + 12.5 
$56.25 mn 


where C is the litigation cost. We deduce that: 


L 
EAD x PD 
54 
12.5 x 10? x 196 
— 4596 


LGD - 


This figure is larger than 3596, which is the loss given default without taking into 
account the recovery cost. 


3. (a) The Beta distribution allows to obtain all the forms of LGD (bell curve, inverted- 
U shaped curve, etc.). The uniform distribution corresponds to the case a = 1 
and 8 = 1. Indeed, we have: 


f(x) = 


7 81f we consider the approximated formulas, the solutions are Îi = 166.7 bps, deo = 375.0 bps and 
A3 = 308.3 bps. 
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(b) We have: 


£(a, B) 


I 


2 mf (a) 
= —nIn B (a, 8) + (a — 1) /Inzi + (8-1) In (1 ~ a) 


The first-order conditions are: 


d£(a,8) ^ aBa b) Gy. 
da Bas) EXE 


and: 
0 £ (o; B) OgB (a, b 


= n ) Sm — dd) = 
9B — " Bas) x. (1— 2;) 20 


Lom 
e 
— 


Let ur ap and orap be the mean and standard deviation of the LGD parameter. 
The method of moments consists in estimating o and f such that: 


Q 
a 4 B — HLGD 

and: 

ap z 

(a+ a++) P 
We have: 
B= m — Lap) 
HLGD 

and: 


It follows that: 


(a+)? = (a+ a ten) 


HLGD 
Lap 
and: " 
1— dm 
aß = x (o ! ( HGD) | 1) OLap = a2! HLap) 
HLGD HLGD HLGD 


( (ls Lun = (1 — tied) HLGD 
a | 14 = 5 1 
OLGD 


2 

x I 

COMM Prai ( 2 pran) HLGD (3.2) 
LGD 

4 pgp (1 = pep)” 

um = (1 — urap) (3.3) 


2 
?LGD 
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4. (a) The mean of the loss given default is equal to: 


100 x 0% + 100 x 25% + 600 x 50% +... 


HLGD = 1000 
= 50% 
The expression of the expected loss is: 
100 
EL = M! EAD; xE [LGD;] x PD; 
i=1 


where PD; is the default probability of credit i. We finally obtain: 


100 
EL = DD 10000 x 50% x 1% 
i=l 


$5 000 


(b) We have irap = 50% and: 


| a 
OLGD 


1000 
7 j| uU 

B 10 

_ [0.625 

~ V io. 

= 25% 


Using Equations (3.2) and (3.3), we deduce that: 


0.5? x (1 — 0.5) 
A = 5=1. 
OMM 0.252 0.5 5 
n 0.5 x (1 — 0.5)? 
Bum 0252 (1 — 0.5) 5 


(c) The previous portfolio is homogeneous and infinitely fine-grained. In this case, 
we know that the unexpected loss depends on the mean of the loss given default 
and not on the entire probability distribution. Because the expected value of 
the calibrated Beta distribution is 50%, there is no difference with the uniform 
distribution, which has also a mean equal to 50%. This result holds for the Basel 
model with one factor, and remains true when they are more factors. 


3.4.5 Modeling default times with a Markov chain 
1. We have P (4) = P (2) P (2) and P (6) = P (4) P (2). 
2. In a piecewise exponential model, the survival function has the following expression: 


)e?w(-) — atte] uut] 


S (t) = S (15.4 mtn 
We deduce that: 
| InS [oa — In S (tz) 


tin — tm-1 


Am 
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with S (t$) = S(0) = 1. Here, the knots of the piecewise function are tf = 2, t5 = 4 
and t3 = 6. If we consider the risk class A, we deduce that: 


In1 — In (1 — 196) 


At = 2-0 = 50.3 bps 
S In (1— m E — 2.4996) — 15.8 bps 
às c In (1— —— (1 — 4.296%) — 93.5 bps 
We finally obtain the following results: 
Rating A B C 


à 50.8 256.5 1115.7 
Àa 75.8 275.9 856.9 
Àa 93.5 277.8 650.2 


. Let P (t) be the transition matrix between 0 and t. The Markov generator of P (t) is 


the matrix A = (A;,;) defined by: 
P (t) = exp (tA) 
where e™ is the matrix exponential of the matrix M. We deduce that: 


A= nP (t) 
t 


In this example, the direct estimator is given by: 


In P (2) 


A= 7 


We verify that A is a Markov generator because NN Mj = 0 and ÀA;; > 0 when 


iz j. 


. For the piecewise exponential model, we proceed as in Question 2 by adding the knots 


t5, = 2m with m > 4. In this case, we have: 
A, (t) = Am if t € ]2m — 2, 2m] 


with: 

. mS (2m — 2) — In S; (2m) 

7 2 

and S; (2m) = 1 — P; 4 (2m). For the Markov generator, we have: 


Àm 


S;(t) = 1-ejP(t)e, 
= T= e] e'^e, 
We deduce that: 
—0:S; (t) 
A(t) = 
(t) S, (f 
e] Acthe, 


1 — ej ete, 
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In the long-run, the markov chain is stationary. This means that the default probability 
of the different risk classes is the same when t tends to oo and we have: 


AA (co) = AB (co) = Ac (co) = 147.6 bps 
In the short-run, the hazard rate are ranked with respect to the risk class: 
AA (0) < ÀB (0) < Ac (0) 


We deduce that the function 4 (t) is increasing whereas the function Ac (t) is de- 
creasing. For the rating B, the behavior of the hazard function is more complex. It 
first increases like A4 (t) and reaches a maximum at t = 4.2, because the transition 
probability to risk classes C and D is very high. Then, it decreases because of the 
stationarity property. 


3.4.6 Continuous-time modeling of default risk 


1. The Chapman-Kolmogorov equation is: 


We deduce that: 


62.60% 13.14% 5.53% 18.73% 
38.42% 20.74% 6.81% 34.03% 
21.90% 12.29% 4.35% 61.46% 

0.00% 0.00% 0.00% 100.00% 


2. (a) The eigendecomposition of P is equal to P = V DV-1, meaning that: 


PV=VD 
We deduce that: 
P(2)V = PVD 
= VDD 
VD? 
By recursion, we obtain: 
P(n) V = VD” 


We can then calculate P (n) as follows: 
P (n) = VD”V ~! 


The eigendecomposition of P (n) is similar to the eigendecomposition of P: the 
eigenvectors are the same, only the eigenvalues are different. 
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(b) We have: 


0.4670 —0.2808 


0.3561 
0.2065 
0.0000 


and 
0.9717 


0.0000 
0.0000 
0.0000 


We deduce that: 


0.7506 
0.0000 
0.0000 
0.0000 


We verify that: 


62.60% 
38.42% 
21.90% 

0.00% 


VDV! 


= P(10) 


3. Let R; (n) be the rating of a firm at time n whose 


have: 


0.8486 
0.5363 
0.0000 


0.0000 
0.8111 
0.0000 
0.0000 


0.0000 
0.1233 
0.0000 
0.0000 


13.14% 
20.74% 
12.29% 

0.00% 


—0.0264 
—0.2313 
0.8609 
0.0000 


0.0000 
0.0000 
0.5571 
0.0000 


0.0000 
0.0000 
0.0029 
0.0000 


1.0000 
1.0000 
1.0000 
1.0000 


0.0000 
0.0000 
0.0000 
1.0000 


0.0000 
0.0000 
0.0000 
1.0000 


5.53% — 18.7396 
6.81% — 34.0396 
4.35% 61.46% 
0.00% 100.00% 


initial rating is the state i. We 


S$,(n = 1- Pr{%; (n) = D) 


LI Í =e- 


= l1 


a 


P (n) e4 


= 1- (PP) m 


In the piecewise exponential model, we recall that the survival function has the fol- 


lowing expression: 


S; (n) = S; (n = 1) EM) 
We deduce that: 
Mi(n) = InS;(n-1)- nS; (n) 
1—ej P^?-le, 
= In 
1— ej Pre, 
We verify that: 
1 
A) = In| ————— 
(1) "(sra 


-h (1— ej P"e) 


because S; (0) = 1. Numerical values of S; (n) and A; (n) are given in Table 3.1. 
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TABLE 3.1: Numerical values of S; (n) and A; (n) 


3 


S4 (n) 
1.0000 
0.7914 
0.7704 
0.7498 
0.7295 
0.7096 
0.6901 
0.6711 
0.6525 
9 0.6344 
10 0.6167 
50 0.1962 
100 0.0468 


ANAT KRWNHFH © 


Sp (n) 
1.0000 
0.6360 
0.6139 
0.5932 
0.5737 
0.5552 
0.5377 
0.5211 
0.5051 
0.4899 
0.4752 
0.1496 
0.0357 


Sc (n) 
1.0000 
0.3708 
0.3575 
0.3451 
0.3335 
0.3226 
0.3123 
0.3026 
0.2933 
0.2843 
0.2758 
0.0868 
0.0207 


àa (n) 


0.2339 
0.0269 
0.0272 
0.0274 
0.0276 
0.0278 
0.0280 
0.0281 
0.0282 
0.0283 
0.0287 
0.0287 


àp (n) 


0.4526 
0.0354 
0.0343 
0.0334 
0.0327 
0.0320 
0.0315 
0.0310 
0.0307 
0.0303 
0.0287 
0.0287 


Ac (n) 


0.9921 
0.0367 
0.0352 
0.0341 
0.0332 
0.0324 
0.0318 
0.0313 
0.0308 
0.0305 
0.0287 
0.0287 


4. Let P (t) be the transition matrix between 0 and t. 
the matrix A = (A;,;) defined by: 


P (t) = exp (tA) 
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The Markov generator of P (t) is 


where e™ is the matrix exponential of the matrix M. We deduce that: 


In particular, we have: 


A-t !InP (t) 


3.2282 
23.5006 
21.6482 

0.0000 


In P (n) 


n 
In P” 


n 
ln P 


2.4851 
9.9915 
—52.3649 
0.0000 


0.7160 


2.1936 


x 107? 


25.3364 
0.0000 


We verify that A is a Markov generator because 4 Mj = 0 and 4; > 0 when 


We obtain: 
—6.4293 
^ 11.3156 — 
um 5.3803 
0.0000 
izj. 
5. We have: 


We remind that: 


We deduce that: 


— [44 


P(t)= 


exp (tA) 


M? M3 
et =f i a qe deus 


2! 3! 


Pio Bag tg 
tA A A AOE acs 
+5 t6 t + 
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The 6-month transition probability matrix is equal to: 


Bg) ea 


96.90% 1.56% 1.11% 0.43% 
5.32% 89.19% 4.17% 1.33% 
2.60% 8.99% 77.20% 11.21% 
0.00% 0.00% 0.00% 100.00% 


6. We have: 


P (t) e4 


ette, 


We know that: 


We deduce that: 
ej Ae'^e, 


1 — e] e'^e, 


Ai (t) 


3.4.7 Derivation of the original Basel granularity adjustment 


1. We deduce that: 


p(z) = E[L;| X 2 x] 
Exp; (1 + wi (x — 1)) 


Il 


and: 


v(x) = o° (L;|X=2) 

=. A 

= E?[LGD,]p; (x) (1 — p; (x)) + o? (LGD;) p; (£) 
?? [LGD;] + o? (LGD;)) p; (x) — E? [LGD,] p? (x) 


| 
— 


If we assume that p? (x) ~ 0, it follows that: 


v(x) = ( 2? [LGD;] + o? (LGD;)) p: (£) 
(E? + e? (LGD;)) p; (1+ w; (x — 1)) 


We have: 


1 
E? +° (LGD;) = E? + 7 Ei (1— Ei) 


We conclude that: 


v(x) = Ej (G+ i5) pi (1+ vi (x — 1)) 
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2. The computation of the derivatives of u (x) gives: 


Op (x) 
= Ejpiwi 
Ox en 
and: 
P(x) |— 
Ox? 
For the variance, we obtain: 
Q v (a) 1 3 
— Ei Ei | piwi 
Ox G+ ) px 
Since we have: " EN 
CEREALES RA 


T (ag) 
and: 
In h (x) 2 —InT (ag) + ag ln 8, + (o4 — 1) ln z — Box 


We deduce that: 


A, lnh (x) = (@-1)_ 1 
T Bg 
The granularity adjustment function is: 
1 02 (x) lO,v(x) 1 Ox In h (x) 
x) = zv(x v(x 
n 2") (8.u(z)? 20,n(x) 2 e) Op (x) 


3 1 
EL zer Ens pipe cm 
2 4 4 Eipivi 


»-c6 5 (ate A 1 
sil g t+ ED | pt wi 1 4 
2 GOLD TREN »( i em 


E EIC (== =) (A +6 1)) +1) 


. In order to maintain the coherency with the IRB formula, we must have: 


$ (= (pot UR 2) = pi (1 ei (x — 1)) 


dl 
This implies that the factor weight w; is equal to: 
1 Fi 


Wi = ru 
(x — 1) pi 


TN [E orria a) " 


Finally, we obtain the following expression for 8 (a): 


s» = te) CC) (eoe) 


i) 
= : (0.25 + 0.75E;) (4 xii 4j 
p) 


where: 


a 


TE (= 1 


where: 
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4. Since we have E[X] = o48, and o (X) = \/ag8,, the parameters of the Gamma 
distribution are a = 0.25 and 6 = 4. Since the confidence level a of the value at 
risk is equal to 99.5%, the quantile of the Gamma distribution G (0.25; 4) is equal to" 
Lo = 12.007243 and the value of A is 3.4393485. We deduce that: 


1 i 
p (ta) = 5 (0.25 + 0.75 E;) (24893485 + 343034852) 


7 


a 


= (0441.25) (0.ro229040 + Lo7ar964 n.) 
We retrieve almost the Basel formula given in BCBS (2001a, §456): 


B (£a) = (0.4-- 1.2 x LGD) (0.70 + L102) 


l 


In order to find exactly the Basel formula, we do not use the approximation p? (x) ~ 0 
for calculating v (x). In this case, we have: 


v(o) = A (Sed) aem n - BP aem o? 


1 
990) — E (14 5) pim- 22pm (1 + (o — 1) 
Ox 4 4 


We deduce that: 


B(r) = 5 (0.25 + 0.75E;) (- (= - x) (m1! (z-1)- 1) + 


MEET DNE: 
Eipi (1+ 2; (x — 1)) (: +50) ( x 23) 
———M——— 


Correction term 


When the expected LGD E; varies from 596 to 9596, the probability of default p; varies 
from 10 bps to 1596 and the asset correlation p varies from 1096 to 3096, the relative 
error between the exact formula and the approximation is lower than 1%. 


5. We deduce that: 


1 EAD* 7 

QA UR Sx xB (®a)\ _ 9 94 x RWAxn 
8% n* 
EAD* 


* 


= —— (06 +18 x E*) (os + 13.75 x z) — 0.04 x RWANR 


EAD* 
= " x GSF —0.04 x RWAnr 


* 


where: 


GSF = (0.6 + 1.8 x E*) (ss 413.75 x z) 


TContrary to the Vasicek model, the conditional probability of default p; (X) is an increasing function 
of X in the CreditRisk+ model. Therefore, we have £a = HT! (a) and not zo = H-! (1 — o). 
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. Following Wilde (2001b) and Gordy (2003), the portfolio loss is equal to: 


L= Y » EAD; x LGD; x D; 


j-1i€C; 
where C; is the j*^ class of risk. The goal is to build an equivalent homogeneous 


portfolio w* such that: 
L* = EAD* x LGD* x D* 


First, it is obvious to impose that: 
nc 
EAD* = 3 V EAD; 
jHliec; 
Wilde and Gordy also propose to equalize the default rates weighted by exposures: 


ne 


L[EAD* x D'] =E p» 5 EAD; xD; 


j-1i€C; 


'This implies that: 


» 2751 Diec, EAD: xpo; 
Pee EAD* 


nc 
= po SC; X pc; 
j=l 
where pc; is the default probability associated to Class C; and sc; is the corresponding 


Shee BAD: 
Cp Ti 
275512 rec, EAD 


relative exposure: 


We also have: 


L[EAD* x LGD* xD*] = E »» Y EAD; x LGD; x D; 
j-1i€C; 
3 S^ EAD; x EXx pt = S SABE DG 
j-1i€C; j-1i€C; 
= S Y EAD; xE; 
j=l i€C; 


Let Ec, be the average loss given default for Class Cj: 


E Dicc EAD; x Ej 
“  Yiee, EAD; 


We deduce that: 


Y^ Y^ EAD, x qos ES. x Ec, x Y EAD; 
j=l 


j-1i€C; icC; 
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64 
or: 

N P 

E* f. Y PC; x Ee x 2 iiec; EAD; 

= . - 
pent i uo vec, HAD: 
< sc, X p 
c; X pc; x Ep, 


= Mm 
j=l yA $C; X DC, 


We remind that the conditional variance of the portfolio loss is equal to: 


nc 


o? (L| X) - 3 3 EAD? x (E?p; (X) (1 — pi (X)) + 0? (LGD;) p; (X)) 


j-1ieC; 


It follows that the expression of the unconditional variance is: 


o? (L) = c*(E[L| X])+E[o?(L| X)] 


= o? S^ Y^ EAD, Ep: (X) + 
j=l icc; 


Contribution of the systematic risk 


: Y Y EAD? (E?pi (X) (1 — pi (X)) + 0? (LGD;) p; (X)) 


jHliec; 
Contribution of the idiosyncratic risk 
and: 
ne 
o’ (L) = o | X EAD* se, Eepe (X) | + 
j=1 


Contribution of the systematic risk 


Xe (pe; (X) (1 — pe, (X)) - 0? (pe, (X))) »» EAD; E? | + 
j=l iec; 


Contribution of the idiosyncratic default risk 


5 pe, | X EAD? o? (LGD;) 
j=1 


icC; 


Contribution of the idiosyncratic LGD 


For the homogenous portfolio w*, we have: 
EAD*)? 
o? (L*) = o? (EAD* E*p* (X)) + pu x 
n 


(EP (p* à = »*) = e? (p (X))) + 0? (LGD*) p") 


In the CreditRisk+ model, we have: 
pi (X) = pi (1 w: (X — 1)) 
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We have already used the property that E [p; (X)] = pi, which implies that E [X] = 1. 
For the variance, we have: 


o? (pi (X)) = piwo” (X) 
The calibration of the systematic risk implies that the factor weight w* is equal to: 


nc 
rs ae ui pc; we; Ec; $c; 
p*E* 
nc 
ja DC; We; Ec; $C; 
nc 
$a PC; Ec; $C; 


For the idiosyncratic default risk, we have: 


E*'p* (1 — p*) - (p*w*a (X) 
: 2\ Doves, BAD: E 
2 E? (ve, (1 — pe) = (pe; wc;o (X)) ) Sy A Ca 
(ran z, ) 


We use the following equalities: 


(EAD* E ji 7 Yo ec, Hi BAD : 
e; * iecj ? 
j (rap e) 


icc (EAD, E;)* | 2 iec; EAD; ) 
2 nc 

es E; x EAD; ) 2y 2 iecy EAD; 

= Hes? 


where Hc is the Herfindahl defined by the following expression?: 


Zcc (EAD; x E)? 
(Sec EAD; x Ei)” 


Finally, the expression of n* is equal to: 
1 
Yje Ac; He; 86, 


* 
n = 


where: 
E? (rc, (1 m = (pc, 0,0 (X))”) 
(E? (p* (1 — pt) — (prota (X))?) 


We retrieve the expression of n* given by the Basel Committee (BCBS, 2001a, §445). 


ee 


J 


8We do not obtained the same result than Gordy (2003), who finds that: 
2 
DEA EAD; 


2 
Os BAD,) 
J 


He, — 


I 
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However, there is a difference between the analysis of Gordy (2003) and the formula 
Ac, proposed by the Basel Committee. In BCBS (2001a), the calibration of n* uses 
both the idiosyncratic default risk and the idiosyncratic loss given default. In this 
case, we have: 


P. (ve, (1 — pe;) me. (pc, 70,0 (X))”) T pejo? (LGDc;) 


Ac; = 2 2 
(E*)? (p* (1 — p*) = (pte (X))*) + ne? (LGD") 


J 


Using the hypothesis of the Basel Committee — o (X) = 2, we have: 


(pimio (X) = ( 


For X ~ G (0.25; 4), we have already shown that £a = 12.007243. We obtain: 
(pimio (X))? = 0.033014360 x F? 


We remind that: 


1 


Finally, we obtain the expression of the Basel Committee: 


E2 (pe, (1 — pe,) — 0.033 x Fg, + 0.25 x pe, Ec, (1 — Ec;) 


Ac, = 
(E*)? (v (1— p*) — 0.033 x (ey) + 0.25 x p* E* (1 — E*) 


I 


where: 


ad 9 (0e) + Vem" (a) 
Fe, = ® ( UIS Dc; 


and F* = 355 SC; Fe;. 
7. For calculating the granularity adjustment, we proceed in two steps: 


e In the first step, we transform the current portfolio into an equivalent homoge- 
nous portfolio: 


A E 2 ec; EAD; 
Cj MEE n, 
yum Niece; EAD; 
ne 
PDag = sc, x PDe, 
j=l 
Xd $C; x PDe; x LGDe; 


LGDac 


II 


ne 
Viet $C; x PDe; 


where PDe; is the default probability of Class C; and LGDc, is the average loss 
given default of Class C;: 


p» EAD; x LGD; 
2 icc, EAD; 


LGDe, = 
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Then, we calculate 
ne 
Fac = 5 sc; X Pe, 
j=l 


where Fe, is the unit unexpected loss of Class C;: 


— p ( $7 (PDc;) + vee (o) 
Fo, =® | EX PDe, 


'The equivalent number of loans n* is the inverse of the Herfindahl H* index: 


1 1 
H* B pu Ac; x He, x Se, 


n* = 


where: 
2 iec; EAD? 


Hc, = (NET EAD) 


I 


Ac, is calculated as follows: 


LGD£, (Be, — 0.033F2,) + 1 Bc, LGDe, 
Ac, = d 
a= LGDÀs (Bao — 0.033F2q) + 1Bac 


where: 


B, = PD; (1 — PD;) 


e In the second step, we calculate the granularity scale factor: 


PD 
GSF = (0.6 + 1.8 x LGDag) x (ss + 13.75 x m 


AG 
Finally, the granularity adjustment is equal to: 


TNRE x GSF 
QU S ORES UE sed ROBAR 
TU 


* 
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where TNRE is the total non-retail exposure and RWAnp is the total non-retail 


risk-weighted assets. 


3.4.8 Variance of the conditional portfolio loss 


1. D; (X) is a Bernoulli random variable with parameter p; (X). We have 


[Dj (X)] 


pi (X). By definition, the probability distribution of D? (X) is the same than the prob- 
ability distribution of D; (X). It follows that D? (X) is a Bernoulli random variable 
with parameter p; (X). Since D; (X) and D; (X) are independent because the default 


times are conditionally independent in the Basel II model, we obtain: 


[Di (X) D; (X) = E[Di CX] E[D; (X)] 


2. We have: 
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3. We have: 
U[L(X) = XE œ) 
» Do :[LGD;] E [D; (X)] 
z Do :[LGD;] p; (X) 
4. We have: 


n 2 n 
(Y w; LGD; D; eo) = MiwiLGD] D? (X) + X` wiwi LGD; LGD; D; (X) D; (X) 


= v1 izj 
and: 
[A (X) = E (È wi LGD; D; 2) 
= yw : [LGD;] E [D7 (X)] M :[LGD;]E [LGD;] E [D; (X)] E[D; (X)] 
We also have: 
^[b(X) = bs w;E [LGD] p; eo) 


= S wE [LGD;]p»? (X) + X` wiw;E[LGD;] E[LGD;] p; (X) p; (X) 
i—1 ifj 


We deduce that: 


o’ (L(X)) = E[L’(X)] -E'[L(X)] 
= yw [LGD?] [pt eo] - owt ?? [LGD;] p? (X) + 
Yes : [LGD;] E [LGD;] Di) [Dj (X)] - 
Yun :[LGD;] E [LGD;] p; (X) p; (X) 
= Yowk[Lop]] E [D? 00] - Y wR? Lap] pe (X) + 
S ud L[LGD, sides cea (X), D; (X)) 
izj 


= 2 w? (E [LGD}] :[D2 (X)] - E? [LGD;] p? (X)) 
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because cov (D; (X) , D; (X)) = 0. It follows that: 


0? (L(X)) = Y w? (E [LGD}] p: (X) - 


2? [LGD;] p? (X)) 


If we note E [LGD7] = o? (LGD;) + E? [LGD;], we obtain: 


E [LGD?] pi (X) = c? (LGD;)p; (X) + 


o? (L(X)) = wi (c? (LGD,) p; (X) + E? [LGD] p; (X) (1 — pi (X))) 


Another expression is: 


c? (L(X)) = 9 ` w; (EID: (X)] e? (LGD;) + 


?? [LGD] e? (D; (X))) 


because E [D; (X)] = p; (X) and o? (D; (X)) = pi (X) (1 — pi (X)). 
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Chapter 4 


Counterparty Credit Risk and Collateral 
Risk 


4.4.1 Impact of netting agreements in counterparty credit risk 


1. (a) Let MtM 4 (C) and MTMg (C) be the MtM values of Bank A and Bank B for 
the contract C. We must theoretically verify that: 


MtMa+e(C) = MTMA(C)- MTMmsg (C) 
= 0 (4.1) 
In the case of listed products, the previous relationship is verified. In the case of 
OTC products, there are no market prices, forcing the bank to use pricing models 
for the valuation. The MTM value is then a mark-to-model price. Because the 


two banks do not use the same model with the same parameters, we note a 
discrepancy between the two mark-to-market prices: 


MTMA (C) + MTMg (C) z0 


For instance, we obtain: 


MTMa,2(C:) = 10-11=-1 
MTMaip (C2) = -5+6=1 
MTMaia (C3) = 6-3=3 

MTMa,5 (C4) = 17-12=5 
MTMass (Cs) = -5+9=4 
MTMais (Ce) = —-5+5=0 
MTMa+e (C7) = 14+1=2 


Only the contract Cg satisfies the relationship (4.1). 
(b) We have: 


7 
EAD = V ` max (MTM (Cj) ,0) 
i=l 


We deduce that: 


EAD, = 10+6+17+1=34 
EADg = 6+9+5+1=21 


(c) If there is a global netting agreement, the exposure at default becomes: 


7 
EAD = max (>: MTM (C;) D 


i=l 
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Using the numerical values, we obtain: 


EAD, = max(10—546417—5—54+1,0) 
= max(19,0) 
19 
and: 
EADg = max(-11--6—3— 12-- 9-E5-- 1,0) 
= max(-5,0) 
= 0 


QM 
= 


If the netting agreement only concerns equity contracts, we have: 


3 7 
EAD = max (>: MTM (C;) D + S max (MTM (C;) ,0) 


i=1 i—4 
It follows that: 


EAD, = max(10—5+6,0)+17+1= 29 
EADg = max(—114+6-—3,0)+94+5+1=15 


N 
— 
© 
Sab 


The potential future exposure e; (t) is defined as follows: 
€1 (t) — max (xı +o Wı (t) ,0) 
We deduce that: 


je] = [v ines #:0)/ (ae 
z [ twee 


0 


where f(x) is the density function of MtM; (t). As we have MtM,(t) ~ 
N (21, 07t), we deduce that: 


UE a 


With the change of variable y = o, 't~!/? (x — x1), we obtain: 


m atav ox ( j^) dy 
=z] / 2T 2 


c1 vt 


= nj. ó (y) M yo (y) dy 
= ns ( 2) envi - ow], 


ei vt 
- cei) en) 
because $ (—2) = ¢ (x) and & (-2) = 1— (zx). 


: [e (0] 


II 
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(b) When there is no netting agreement, we have: 
€ (t) = €1 (t) + ez (t) 


We deduce that: 


2[e(t)) = Elfei (t)] + E [ez (t)] 
a (a) sae) 
(n) n) 


In the case of a netting agreement, the potential future exposure becomes: 


Il 


Lom 
e 
WN 


e(t) = max (MtM;, (t) + MtM» (t),0) 
max (MtMi+2 (t) ,0) 
= max (zi- c3 4- 01WA (t) + 03W» (t) ,0) 


Il 


We deduce that: 
MtMi42 (t) ~ N (a1 + 22, (of + 03 + 2p2102) t) 


Using results of Question 2(a), we finally obtain: 


ile(t)] = (zı +22) | 


£1 c9 4 
WAC + o2 + 2p0102)t 


"ES +03 + 2p0103) td ( 


£1 c9 
Vot + o2 + 2p0103)t 


(d) We have represented the expected exposure E [e (t)] in Figure 4.1 when zı = 
z2 = 0 and cı = oo. We note that it is an increasing function of the time t 
and the volatility c. We also observe that the netting agreement may have a big 
impact, especially when the correlation is low or negative. 


4.4.2 Calculation of the effective expected positive exposure 


1. We have e(t) = max (MTM (t) ,0) where MTM (t) is the mark-to-market price of the 
OTC contract at the future date t. We note Fo, the cumulative distribution function 
of the random variable e (t). The peak exposure is the quantile o of F(o 4): 


PE, (t) = Foy (a) 
The maximum peak exposure is the maximum value of PE, (t): 


MPE. (0; t) = sup PE, (s) 


'The expected exposure is the average of the potential future exposure: 


EE (t) = E [e (t)] = j 2dF oy (2) 
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One OTC contract Two OTC contracts 
0.4 p = 707 


t (in years) t (in years) 


Two OTC contracts Two OTC contracts 
p = -507 t = 5 years 


t (in years) p (in 7) 


FIGURE 4.1: Expected exposure E [e (t)] when there is a netting agreement 


We define the expected positive exposure as the weighted average over time of the 
expected exposure for a given holding period (0, t]: 


t 
EPE (0; t) = J EE (s) ds 
0 


The effective expected exposure is the maximum expected exposure which occurs 
before the date t: 


EEE(t) = supEE(s) 
s<t 


= max (EEE (t`) , EE(t)) 


The effective expected positive exposure is the weighted average of effective expected 
exposure for a given time period (0, t]: 


t 
EEPE (0; t) = J EEE (s) ds 
0 


2. We have: 


Fjo, (£) Pr {e(t) € a} 


= Pr {ovix < a} 


Il 
F 
ym; 
be 
IA 

a 
2|s 
A 
—— 
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with x € (0, avt]. We deduce that: 


PE, (0;t) = ac vt 
MPE. (0; T) = ao YT 


evt 1 t 
BE() = f pou pose 
0 ovt 2 
1 [É ais ovt 
E(0;#) = Ż ds = —— 
EPE (0; t) a ae 
ovt 
EEE (t) = —— 
tar t 
EEPE (0; t) = 3 ag OVE 
3. We have: 
Fonte) = etes) 


with x € [0,00]. We deduce that: 


EPE (0;t) = (cx (52%) 1) / (30) 


1 
EEE (t) = exp (3) 


EEPE (0; t) = (e (30) 1) / (50°) 


£ 
~ o (8 — 2TH + 4T) 
with x € [0,0 (t? — Tt? + $1?t)]. We deduce that: 


4. We have: 
F 0,4] (x) 


4 
PE, (0) = ac G - ir? + i73) 


MPE. (0; t) = 1 (t < t^) x PFE, (0; t) + 1 (t > t*} x PFE, (0;t*) 


1 4 
EE (t) = 57 (« ua + 51°) 
9t? — 28Tt? + 24T?t 
72 


EEE (f) = 1 (t < t*) x EE (t) + 1 {t > t*] x EE (t*) 


EPE (0; t) -«( 


T5 
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1 t 
EEPE (0; t) = 3 EEE (s) ds 
0 


with: 


ra (228) 2 


This question is more difficult than the previous ones, because e(t) is not a mono- 
tonically increasing function. It is increasing when t < tł and then decreasing!. This 
explains that MPE, (0;t) and EEE (t) depends on the parameter t*. 


5. The cumulative distribution function of X is: 


Il 


F (x) Pr{X € x] 


T u? 
= d 
f a+1 ü 


get! 


We deduce that: 
Fog (z) = Pr(e(t) £a) 
Pr {ovix < a} 


I 


ll 

"y 

d 
——s 

v 

IA 
q 
als 
D 
—— 


and: 


It follows that: 


and: 
MPE, (0; T) = al (+o /T 


'The expected exposure is: 


avt a 
E (a+1)x _ (a+1)ovt 
EE (t) =| ov de = ET 


We deduce that: 


BEE (o = C+ Devt 
and: ( )o Vs ( )avt 
jog t1 9/5 2 (a - 1)o vt 
gEPE(x0 - + f a+2 MET EAT 


lIn fact, there is a second root: 


9 


We observe that e(t) can take a negative value when t is in the neighborhood of this solution. We ignore 
this problem to calculate the different measures. 


tj = Gor 
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6. In Figures 4.2 and 4.3, we have reported the functions EE (t), EPE (0; t), EEE (t) 
and EEPE(0;t) for the two exposures given in Questions 3 and 5. We notice that 
the second exposure has the profile of an amortizing swap where the first exposure is 
more like an option profile. 


EE(t) EPE(0;t) 
15 5 
4 
10 
3 
2 
5 
1 
0 0 
0 5 10 15 20 0 5 10 15 20 
t (in years) t (in years) 
EEE(0) EEPE(0;t) 
15 5 
4 
10 
3 
2 
5 
1 
0 0 
0 5 10 15 20 0 5 10 15 20 
t (in years) t (in years) 


FIGURE 4.2: Credit exposure when e (t) = exp (oVtN (0, 1)) 


4.4.3 Calculation of the required capital for counterparty credit risk 


1. We have: 


Fo (£) = Pr{e(t) E xj 
— Pr {Noviv < a} 


= {Us aen) 
m (xax) 


with x € [0, Novi]. We deduce that: 


PE, (t) = Fo] (a) 
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EE(t) EPE(0;t) 
8 8 
6 6 
4 4 
2 2 
0 0 
0 1 2 8 4 5 0 1 9 3 4 5 
t (in years) t (in years) 
EEE(t) EEPE(0;t) 
8 8 


0 1 2 3 4 5 "0 1 2 3 
t (in years) t (in years) 


FIGURE 4.3: Credit exposure when e(t) = o (t — T? + $T?t) Up 


For the expected exposure, we obtain: 


EE (t) : [e (t)] 


E [nde 
oo Nova” 
y yet 

~ (Novi)’ I 


y 
= — Novi 
y+1 


0 


We deduce that: 
EEE (t) = —— Novi 
y+1 


and: 


1 


t 
EEPE (0; t) f T EEE (s) ds 
0 


Il 
| 


I ^y 
= = ——N d 
ae se 


t 
= UE Ng! EA 
y+1 t |3 0 


2y 
= 5. Novt 
3(44- 1) evt 


2. (a) When the bank uses an internal model, the regulatory exposure at default is: 


EAD — a x EEPE (0; 1) 
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Using the standard value a = 1.4, we obtain: 


4 
EAD = 14x g X 3 x 10° x 0.20 
$373 333 


(b) While the bank uses the FIRB approach, the required capital is: 


K = EAD xE [LGD] x (o (= (ED) uu a) PD) 


When p is equal to 20%, we have: 


-1 (PD) + /p®-* (99.9%) |^  —2.33-4- v0.20 x 3.09 
/1 =p a vI — 0.20 
= —1.06 


By using the approximations —1.06 ~ 1 and  (—1) ~ 0.16, we obtain: 


K = 373333 x 0.70 x (0.16 — 0.01) 
$39 200 


'The required capital of this OTC product for counterparty credit risk is then 
equal to $39 200. 


4.4.4 Calculation of CVA and DVA measures 


1. The positive exposure e* (t) is the maximum between zero and the mark-to-market 
value: 


et (t) = max(0,MtM (t)) 
max (o. NovtX) 


We have: 
Fog (£) = Prfe*@) <a} 
= Pr { max (0, Nevtx) <x} 


We notice that: 


0 if X <0 
xps (0, Nevix) = { NovVtX otherwise 


By assuming that x € [0, Novi], we deduce that: 


Fo. (£) = Pr(et(t) Ez, X <0}+Pr {et (t) <2,X >0} 
= Pr (0 € z, X <0}+Pr{Novix < z, X >0} 


Pr {Noviw < a} 


uu a 


N|O ple 
NIe r5] e 
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where U is the standard uniform random variable. We finally obtain the following 
expression: 
x 


2NovVt 
If € 0 or x> Noy, it is easy to show that Fo, (£) = 0 and Fo (x) = 1. 


1 
Fjo, (£) = at 


2. The expected positive exposure EpE (t) is defined as follows: 
EpE (t) = E [e* (t) 


Using the expression of Fjo (2), it follows that the density function of e* (t) is equal 
to: 


OF o.4 (2 
fo. (@) = B 


1 
2NovVt 


We deduce that: 


Not 
EpE(t) = ri z fo, (x) dx 


Novt 
x 
0 2NovVt 
al, 
4Novt 


Novt 
4 


0 


3. By definition, we have: 
T 
CVA = (1— Rpg) x n — Bo (t) EpE (t) dSg (t) 
0 


4. The interest rates are equal to zero meaning that Bo (t) = 1. Moreover, we have 
Sp (t) = e~>8". We deduce that: 


ae. 
CVA = (1- Re) x | Nov eet dt 
0 


_ NAB Rao a 
0 


4 


'The definition of the incomplete gamma function is: 


visa) = f pte dt 
0 


By considering the change of variable y = Apt, we obtain: 


T ABT 

vte™òBtdt = ni , |e Y 

f 0 AB AB 
1 A 


BT y 
E y^ e dy 
un i 
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It follows that: 
N(1—Rep) oy (2, ABT) 


4/XB 


5. The CVA is proportional to the notional N of the OTC contract, the loss given 
default (1 — R g) of the counterparty and the volatility o of the underlying asset. It is 
an increasing function of the maturity T because we have y (3, À pT) y (3, À BT) 
when T5 > Ti. If the maturity is not very large (less than 10 years), the CVA is an 
increasing function of the default intensity Ap. The limit cases are?: 


CVA — 


N(1-R 3 AgT 
in GVA = tig D aloy (GAT) 
ABO OO AB OO AV Ap 


and: " 
NA-R T (3 
meme? e EE 


When the counterparty has a high default intensity, meaning that the default is im- 
minent, the CVA is equal to zero because the mark-to-market value is close to zero. 
When the maturity is large, the CVA is a decreasing function of the intensity Ap. In- 
deed, the probability to observe a large mark-to-market in the future increases when 
the default time is very far from the current date. We have illustrated these proper- 
ties in Figure 4.4 with the following numerical values: N = $1 mn, Rg = 40% and 
a = 30%. 


5.0 


0 \ 
9 \ 
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i \ 
\ 
2.5 \ — [21 
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\ 
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‘N 
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x 
1:5 Sa 
ur 
-— = ~ æ ag 
i ab E m 
Pd iln ee 
WE gy eae e 
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/ 
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0 2 4 6 8 10 12 14 16 18 20 
Ag (in Z) 


FIGURE 4.4: Evolution of the CVA with respect to maturity T' and intensity Ap 


6. We notice that the mark-to-market is perfectly symmetric about 0. We deduce that 


?We have lims—=oo 7 (s, £) =T (s). 
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the expected negative exposure EnE (t) is equal to the expected positive exposure 
EpE (t). It follows that the DVA is equal to: 


N (1— Ra) 07 (3, AAT) 
4VXa 


DVA = 


4.4.5 Approximation of the CVA for an interest rate swap 


1. We have: 
Ae’-B>O0e2a>a2*=mnB-InA 


It follows that: 


oo 1 = 
j [max (Ae* — B,0)] = max (Ae* — B,0) —ó (z Ax) dax 
= Ox Ox 
= A e” l (==) dz 
ae Ox Ox 
B dg (=) dz 
at OX Ox 


By considering the change of variable y = ox! (x — x), we deduce that: 


EpE (t) = A if gtox 4 (y) dy — B / $ (y) dy 
y* y 


* 


where y* = ox! (z* — ux). We have: 


oo 
af e"x*v?X9$(y)dy = Ae”x e- BY FOXY dy 
y* 


5l Box)? ay 


3) 
E 


— - Aetio ux t 03, -InA—- ln B 
= = 


and: 


- Bọ (e) 
ox 


Finally, we obtain: 


EpE() = Aerstieka ( 


BO (= eens 
Ox 


Lx Tt 03, -lnA—- In B 
ox 
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2. The mark-to-market is an approximation of a fixed-float IRS in continuous time by 
assuming that the floating leg is constant, implying that the term structure of the 
float rate is flat (Syrkin and Shirazi, 2015). The first term of the mark-to-market is 
the floating leg, because the cash flows change with the time t, whereas the second 
term is the fixed leg?: 


mem (y=N f 10.09) as—v f $0.7) Bets) ds 
—_—_— M Oe” 


Floating leg Fixed leg 


Since the instantaneous forward rate follows a geometric Brownian motion, we deduce 
that: 
f (0,7) = f (0,7) eo ewe) 


and: 


f(0,T)~ LN (m: (0, T) + (» - 57°) tet) 


We also have: 


oet, T) = EET 


I 
ET 
[Sy] 
| 
pu 
3/3 
J 
4 
LL 
r 


It follows that: 


Il 


MtM (2) NEGT) - r0.) f B(s) ds 
t 
= NfO,T)y(t,T) (Aae z 1) 
The confidence interval of MtM (t) with confidence level o is defined by: 
MtM (t) € [q_ (t;a) . q— (t: @)] 


where: 


qx (t;a) = Nf (0,T) e (t, T) (EPE R Ca) - 1) 
3. For the expected mark-to-market, we have: 
E[MtM(] = Nf(0,T)v (T) (e 37^*g [ewo] -1) 


= Nf(,T) e (tT) (elte 1) 
= Nf(0,T)ov(t,T) (e — 1) 


3 f (0, T) is known at time t = 0. 
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For the expected counterparty exposure, we have: 


e(t) = 2 [max (f (0,7) (c7 $999 — 1) ,0)] 


= E [max ((f (0,7) e08 tewe — 70,7) ,0)] 
= E [max (Ae* — B,0)] 


where A = Nf (0, T) e (t, T) e^-39)*, B = Nf (0,T) e(t, T) and X ~ N (0,074). 
Since In A — In B = (u — $07) t, we obtain: 
o?t -IlnA— In B ln A — ln B 
rien cud 
= Nf(0,T)e(5T) (e (5(t)) -8 (84) - evt) 


EpE(f) — acia ( 


where: 


. We have: 


T 
CVA(t) = (1—-'R)x l — B, (u) EpE (u) dS (u) 


T 
(1—R)x / Ae-(o*9-0 EpE (u) du 
t 


I 


T 
Sx f e- (709-0 EpE (u) du 


t 


where S is the credit spread of the counterparty. 


. Syrkin and Shirazi (2015) propose the following approximations: e’ — 1 ~ pt, 


d UC) 


to EAE) C) 
We have: 
(x) = etp ((4 4 57) 2 -ö ((4 = 57) 2 
= (e#-1)6 (£ + 57) vi) + 
e(l) E- 
~ ut ($) +ovid (2 


Therefore, an approximation of the CVA is: 


T 
CVA (t) zz s x N x f (0, T) «f g(u) du 
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where: 


g(u) =e qr) (ue (A) uag (E) va) 


To calculate this approximation, we use a numerical integration method. Syrkin and 
Shirazi (2015) provide a second approximation that does not require any integration, 
but it seems to be less accurate. 
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FIGURE 4.5: Confidence interval of the mark-to-market 


6. All the computations are done using a Gauss-Legendre quadrature of order 128. 


(a) We have reported the 9096 confidence interval of MtM (t) in Figure 4.5. 


(b) The time profile of EpE (t) and E [MtM (t)] is shown in Figure 4.6. We verify that 
EpE (t) > E[MtM (t)] and we retrieve the bell-shaped curve of IRS counterparty 
exposure. 


(c) In Figure 4.7, we observe that the approximation of the CVA gives good results. 


(d) When we calculate the CVA, we consider a risk-neutral probability distribution 
Q. This implies that u = 0% is a more realistic value than u = 2%. 


4.4.6 Risk contribution of CVA with collateral 
1. Since we have MtM,; (t) = mi (t) + c; (t) Xi, we deduce that: 


MtM(t) = »3 wi (pi (t) + oi (t) Xi) 


i=1 i=1 
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FIGURE 4.6: Comparison of EpE (t) and E [MtM (t)] 
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FIGURE 4.7: Approximation of the CVA 
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Let u(t) = (pa (t),..., Hn (t)) be the mean vector of (MtMi (t),...,MtM, (¢)). It 
follows that the expected value Hw (t) of the portfolio mark-to-market is equal to: 


Holt) = E[MtM (8) 
2) wipi (t) 


= w' p(t) 


We define the volatility ow (t) of the portfolio mark-to-market: 


c2(t) = var(MtM (t)) 


= var (>: wici (t) x) 


j»i 
TL 
= S uo? (t) + 5 wiw;o: (t) (t) Pij 
iel j>i 
= wl'X(t)w 


where X (t) is the covariance matrix of (MtM; (t),..., MtMn (t)) such that: 


355 (t) = pi jai (t) a; (t) 


It follows that: 


where X ~ N (0,1). We deduce that the portfolio mark-to-market is a Gaussian 
random variable: 
MEM (0) ~ N (12, (0) ,02 (1) 
. We have: 
cov (MtM; (t) , MtM (t)) 
„var (MtM, (t)) var (MtM (t)) 
: os (0 Xi XT we; (0 X] 
ci (t) o (t) 
j an w30; (t) xaX; 


It follows that: 
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where the idiosyncratic risks e; ~ V (0, 1) are independent and satisfy c; L X. We 
verify that IE [X;] = 0 and: 


o? (Xi) = (0c 00 (1-70) a? (e) 
= O+- 
1 


. If we note e*t (t) 2 max (MtM (t) — C (t),0) and C(t) = max (MtM (t) — H, 0), the 


expression of the counterparty exposure is equal to: 


et (t) = max(MtM (t) — max (MtM (t) — H,0),0) 
= MtM(t)-1{0< MtM (t) < H} + H- 1(MtM (f) > H} 
We have: 
MtM(t) 2H e uw(t)-os (the > H 
* H — Hw (t) 
e rz2z'(H)- pz 
and: 
MtM (t) > 0% g> a (0) = BD 
We deduce that: 
a*(H) 
BPE (tw) = f (u (+ou (02)6(2) de + 
x* (0) 


d 
CC x 


We have: 
a* (IH) 
One | (dus (£) + ou (2) 2) 6 (a) de 
x*(0) 
z*(H) z^(H) 
= quw xv) dz + Ow xo (x) da 
u of. ó (x) dz + of $ (x) 
= us (0 (9 (e* (H)) — 9 (2* (0) + ow (8) = m 
=. Ly ( x x Ow vat NT 
— a (€) (8 (* (H)) — 9 (2* (0))) + ow (£) (6 (2* (0) —  (a* (2D) 
and: 


f. 90 dc -1- 9G an) 
a*(H) 


Using the fact that (x) +  (—7) = 1, we finally obtain the following expression: 
eo = sete) (e) 
a0 (o(a) oa t 
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4. C(t) =0 is equivalent to impose H = +oo. Indeed, we verify that: 
C(t) = max(MtM (t) — H,0) 

= max(MtM (t) — oo, 0) 

= 0 


It follows that: 


BPE (tiu) = pw (0) (ED) + on (6 (S23) (44) 
We have: 5 , 
bed ) = pt) 


and: 
dos _ EHW); 
Ow; c (t) 
= y(t) o; (t) 


because we have the following relationship between (3X (t) w); and q; (t): 


(Z(àw) = Yo piso: lt) o (t) w; 


It follows that: 


We deduce that: 


and: 
O9 (HoD) | Bw(t)  ( Bo()N 3 (Hw (t) 
Pu e)" c ee) oue) 
because ¢ (r)' = —xó (x). Therefore, the expression of the marginal risk is equal to: 
E a c 
Hw (t)\ 9 ( bw (t) 
eO) dm (ant) * 
vaate (E52) - 
Hw (t) Hw (t) Ó Hw t) 
0 DEO) dm ED) 
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Finally, the expression of the risk contribution is given by: 


RC, = apet 
siete e o — « 
We have: 
yore, = Yu wo (5e n) DX Or ao (£e a) 
= (e) ones (e) eon 
- (2) cns) 
= EpE(t;w) 
because: 


> w (t)oi(t) = i (E p 2 c (t) 


(0 Mia 222 wiw;pi;0; (t) ei (t) 
E c (t) 

.. € (t) 

|. c(t) 

= a(t) 


We conclude that the risk measure EpE (t; w) satisfies the Euler allocation principle. 


. We can write: 


EpE (t; w) = E; (t; w) — E» (t; w) + Es (t; w) 


where: 
Eji(tw) = pw (ta (5s o) ee aes 3) 
E (tw) = py (t)® (s2 =) EOE)? C =) 
Ben = mo (eE) 
We have: 
ga (ea) = am (Sa) Hom (acu) 
Hw (t) 
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Counterpa 


e that: 


We deduc 


e have: 


It follows that the marginal is equal to: 


O Ey (t; w) 
Ow; 


ô EpE (t; w) 


S 
SS 
de 
D 
+ 
E 
D3 
de 
D 


Ó wj 
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The expression of the risk contribution is given by: 


RC, = wip (t) (è (5 a * Gaon! : 


moao( (E9) (E8) ^ a 
We have: 
Sro = Duo (e (#8) -a (875). 


I 

io 
"a 

Ej 
e 
x 
e 
ZA TUN 


'These risk contributions do not satisfy the Euler allocation principle, meaning that it 
is not possible to allocate the CVA capital according to Equation (4.6). 


6. Type A Euler allocation is given by: 


RC; = E[w;MtM; (t) - 1 (0 € MtM (t) < H)] + 
:[1 (MtM (t) > H)] E [w; MtM; (t) - 1 (MtM (t) > HJ] 
:[MtM (2) -1 (MEM (t) > H} 


H. 


Using Equation (4.2), we have: 


= wip (t) + wisi (t) yi (t) X + wioi (t) y 1—7? (t) ei 


Since e; L X, it follows that: 


- Ele; (ui E < pw (0) tou (0 X < H]] 


\| 
=S 
8. 
5 
= 
= 
Pas 
EE 
S 
n 
& 
lem 
8 
+ 
pom 
»* 
* 
g 
A 
na 
t. 
B3 
Pm. 
SU 
m 
S. 
ames 
8 
— 
e 
3 
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| 

£ 
E 
PERS 
St 
S 
e 
Lm 
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e 
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and: 


We also have: 


a = 


II 


and: 


() = E[L(MM (9 2 HY 


= f $ (x) da 
z*(H) 


- (a) 


:[MtM (t) - 1 (MEM (0) > HY] 
2 [(Hw (t) + Ow (t) X) “1 {Hw (t) + Ow (t) X 2 Hj| 
( 


he Hw (t) + Ow (t) x) 6 (£) dz 
Heus (89-1 + oy (t) ó (eS 


w 


(x) = E[w;MtM;(t)- 1{MtM (t) > HY 
= E [wi (pu (£) + e (0) Xi) 3 (uo (t) + ow (t) X > HY 


(ete ow) 
ee 
where: TE Ga =) + wiqi (t) ei (t) o Ga H 
and?: 


vo c m 0 (MOF) ou ne (0-7) 


7. The type B Euler allocation is given by: 


RC; 


= E[w; MtM, (t)-1{0 € MtM (t) < H} + 
Wi MtM; (t) 
E | MtM (t) 


. 1 {MtM (t) > z 


4We notice that Yw = P» Vi. 
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We have: 
wi MtM; (t) _ Wifi (t) + Wisi (t) ^Y (t) X , Wii (t) 4/1 — y? 25 
MtM (t) lw (t) + Ow (t) X "tty OE OA - 


Since e; L X, it follows that: 


| g | wi MtM, (t) 
Ue | MtM (t) 
- E E (ui (£) + y: (0) ei (t) X) 

Hw (t) + ow (t) X 
; wies (£) v1 n1 (f) 
Hw (t) + Ow (t) X 


. 1 (MEM (f) > m) 


Aes (2 os (2) > 1] + 


A (us (t) + ow exsan| : [ei] 


MENCIOEZTOLAOAM 
Larl Hw (t) + Ow (t) x )« ue 


Finally, we obtain: 


. It follows that: 


MtM(t) = MiwiMtM, (t) 


= fly (t) + ow (t) X 
where X ~ N (0, 1). The correlation between X and Xp is given by: 
cov (MtM (t) , Xp) 
v/var (MtM (t)) var (X p) 


S [Doin Wis (t) X: XB] 
c (t) 


Qw (t) m 


S [Doin wio; (t) oi XB Xp] d i pe S CO MESI nX] 


o (t) c (t) 
Wisi (t) 
= (t) Oi 


i=1 
We deduce that: 
X = ow (t) Xa + V1— o; (t) 0 


where the idiosyncratic risk 7 ~ NV (0,1) is independent from Xp. 


(4.8) 


(4.9) 
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9. Pykhtin and Rosen (2010) notice that all previous computations involve unconditional 

expectations, implying that we can derive easily the expected counterparty exposure 
i [e (£)| and the corresponding risk contributions RC; by replacing all unconditional 
expectations E[Y] where Y is a random variable (MtM; (t), MtM (t) and e(t)) by 
conditional expectations E [Y | T = t| where 7 is the default time of the counterparty. 
Following Redon (2006), this is equivalent to calculate the conditional expectation 
with respect to the random variable Xp: 


jy |T =t] =E[Y | Xp = B (t)] 


where B (t) = $^! (1 — S (t)) is the default barrier and S (t) is the survival function 
of the counterparty. For conditional means, we have: 


pi (t| T = t) = mi (t) + eios, (t) B (t) 


and: 
jus (t | T=t) = Py (t) + Ow (t) Ow (t) B (t) 


For conditional volatilities, it follows that: 


oi (t| T —t) — 4/1— @70; (t) 
Ow (t| T=t) = V1 —- ©, (t)ow (t) 


Since the unconditional correlation q; (t) is equal to cov (X;, X), we have: 


and: 


y(t) = E[X;X] 


= (axe + Vi - am) (os 0 Xe + O)| 
= wlt) - J1— o? V1— es u(t] v —1) 


where 7; (t | T = t) is the correlation between 7; and 7 or the conditional correlation: 


^i (t) = oiov (t) 
vA - eA - ez (t) 
To compute EpE (t; w) = E [e* (t) | T = t| and RC;, we replace ui (t), Hw (t), o; (t), 


Ow (t) and qi (t) by uw; (t| T—t) ww (t| T—t, ei(t| T —0), ev (t| T =t) and 
yi (t | T = t) in Equations (4.3), (4.4) (4.5), (4.7) and (4.8). 


vu (¢| 7 =t) = 
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5.4.1 Estimation of the severity distribution 


1. (a) The density of the Gaussian distribution Y ~ N (u, o°) is: 


g(y) = = zT (5) 


Let X ~ LN (u,o?). We have X = expY. It follows that: 


with y = In z. We deduce that: 


no = sies): 


(b) For m 2 1, the non-centered moment is equal to: 


ee 1 1 /nz-u\’ 
z [X"] = D qu exp ( = £) dx 
0 LOVT 2 o 


By considering the change of variables y = o^! (ln z — p) and z = y — mo, we 
obtain: 


— oo 


s 1 ye 
E [X] E J emetmay e 3" dy 
v 2T 
TP. xq 1,2 
= sem «f ——e 39 moy dy 
Le. MATT 


m lm?g? T —il(y—-moe)y? 
= e" x ea x ——e 2 dy 
Soo W2T 


We deduce that: 
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and: 


var(X) = E[X?| —E? [Xx] 


2 2 
= e? +20 E gato 


= get (e — 1) 


We can estimate the parameters u and o with the generalized method of moments 
by using the following empirical moments: 


iil o) =a, — età 


hia (p, 
2 
hia (t,o) = (s. 2 ede?) _ eduto? Ca E 1) 


(c) The log-likelihood function of the sample {x1,...,@n} is: 
S nf (zQ 
i=1 


n n 2 
" n 2 n 1 Ina; —p 
= 5 Ing 5 ln 27 2x Ina; 5 1 ( - ) 


i=l 


Il 


£(u,o) 


To find the ML estimators fi and 6, we can proceed in two different ways: 


#1 X ~ LN (u.c?) implies that Y = In X ~ N (u, 0°). We know that the ML 
estimators (i and 6 associated to Y are: 


1 TL 


> 
Il 


We deduce that the ML estimators f and 6 associated to the sample 


(21,..., Zn} are: 
=  — n ri 
: Hem 
ô = 1 y^ (nz, - Ay? 
n I 


i=1 
#2 We maximize the log-likelihood function: 
(ji, 6} = arg max £ (11,0) 


The first-order conditions are ô, £(u, 0) = 0 and 0, £(u, c) = 0. We deduce 
that: 


O,£(u,0) = —5 2 (nai = p) = 


and: 


wa 
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We finally obtain: 


and: 


The probability density function is: 


fay = opt Sh 


qr) 


For m > 1, we have: 


Il 
8 
NE 
e 
o— 
8 
8, 
z 
| 
R 
| 
E 
e. 
8 


We deduce that: 


and: 


var (X) = E[X?]- E?’ [X] 


2 
za Q 3 a 
- gea (<4) 


(a-1)(a-2) 


We can then estimate the parameter o by considering the following empirical 
moments: 


hia (o) = Tti 


a a 2 


Se) (a-1)(a-2) - 


hi2 (a) = (s z 


The generalized method of moments can consider either the first moment h; (a), 
the second moment h;,2 (œ) or the joint moments (hi1 (o) , hi,2 (a)). In the first 
case, the estimator is: 

EE 


im oua Vi nr 
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(b) The log-likelihood function is: 


f(a) = din f(a) 


nlno - (a+ 1) 5 nz; + nalnz. 
i=1 


The first-order condition is: 


Os lla) = £ - Mns; Mns. =0 
i=1 i=1 
We deduce that: " 
P3 
n 23 n a 


The ML estimator is then: 
n 


> (nz; lng) 
3. The probability density function of (iii) is: 


OPr{X €x 
f(a) = DP sat 
x 
Beale Be 
T (a) 
It follows that the log-likelihood function is: 


dns (xi) 


= -nlnT (o) + nalng + (a-1)X ` nz: - BY xi 


i=1 i=1 
The first-order conditions ĝa £ (a, 8) = 0 and Og £(o, 8) = 0 imply that: 


n(m- Te) +% ma, =0 


a= 


£(a, B) 


II 


and: r 
ns — 5 z;—0 

i=1 

4. Let Y ~T (a, p) and X = expY. We have: 


fx (x) |dz| = fv (y) |dyl 

where fx and fy are the probability density functions of X and Y. We deduce that: 
Beye tefy 1 

T (a) * a 
B? (In z)^^! e-8Inz 

aT (a) 

pens 
T (a) z&*1 


fx(z) = 
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The support of this probability density function is [0, +00). The log-likelihood function 
associated to the sample of individual losses [z1,...,24] is: 


£(o,8) =) Inf (2%) 


= —nlnTP (o) + nan 8 + (a — 1) X ln (Inz;) — (B4 1) Sona; 
i=1 


i=1 

(a) Using Bayes’ formula, we have: 

Pr{H< xX <a} 
Pr{X > Hj] 

F (x) — F (H) 
1- F(H) 


Pr{X <a|X>H} = 


where F is the cdf of X. We deduce that the conditional probability density 
function is: 


f(e|X>H) = à Pr{X<a2|X>H} 
f (x) 

= ——— xl{r> H 

i-pu ed 
For the log-normal probability distribution, we obtain: 

1 1 1(Ine—n)? 
r| X> H) = x e-3( z^) dr 

fG|X2H) = - Ta e 
= ue x. 


We note Mm (4,0) the conditional moment E[X™ | X > H]. We have: 
oo m-—1 -— 
Mm (uso) = ex | Te G0) ds 


ex [^ C Ha mas 
l 


nH OV2T 


oo z— mo? 
excedere [7 1 LEER 
In H Ov 2r 


1 ® (nance?) 


o 
= € 


1-6 (==) 
The two first moments of X | X > H are then: 
1- g (1e ] 


1-6 (=) 


mu+4m’ o? 


etta? 


Mı (u,0) =E[X |X X 2 H] = 


and: 


1-6 
Ma(pu,o) = VIX? |X > H] = ( id lcs 
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We can therefore estimate and o by considering the following empirical mo- 


ments: 
{ hia (u,0) = vi — Mı (u,0) 2 
hi; (u, 0) = (zi — Mı (H,0))” — (Me (u, 0) — Mi (m, 0)) 
(b) We have: 
f(| X >H) = zu x 1{zx > H} 
(5) 
E | 
= a 


The conditional probability function is then a Pareto distribution with the same 
parameter o but with a new threshold rz. = H. We can then deduce that the 


ML estimator à is: 
n 


Onz) -nl H 


a= 


(c) The conditional probability density function is: 


f(e|X>H) = Deus ribi 


aw )/ © Bopo-le—Bt 
Cae JL 9-9 


Beale Pe 
eo —1e-ft 
Jor epe edt 


We deduce that the log-likelihood function is: 
L(a,B) = nalng-nln (/ pete trat) E 
H 


(a — S dus E» 
i=l i=l 


5.4.2 Estimation of the frequency distribution 


1. We have: 
AB 
— pl — pàr ^Y 
Pr{N=n}=e ^Y F 
We deduce that the expression of the log-likelihood function is: 


T 
£Qy) = X nPr{N =Ny,} 


t=1 


T T 
= -dyT+ (Y s) In Ay — 3 In (Ny,!) 


t=1 t=1 
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The first-order condition is: 
8 £ (Ay) oo 
Y 
ps pe > J 


We deduce that the ML estimator is: 


imd n 

Ày ==> Ny —— 

SE fete 

t=1 
Using the same arguments, we obtain: 
AT N 

N 1 n Y 
ào = — No, = = = — 
9? 4AT£ 7" ap 4 


. Considering a quarterly or annual basis has no impact on the capital charge. Indeed, 
the capital charge is computed with a one-year time horizon. If we use a quarterly 
basis, we have to find the distribution of the annual loss number. In this case, the 
annual loss number is the sum of the four quarterly loss numbers: 


Ny = No, + Ngo» + No, T9 


We know that each quarterly loss number follows a Poisson distribution P (Ac) and 


that they are independent. Because the Poisson distribution is infinitely divisible, we 
obtain: 


No, + No, + No, + Na, ~P (año) 


We deduce that the annual loss number follows a Poisson distribution P (Av) in both 
cases. 


. This result remains valid if we consider the first moment because the MM estimator 
is exactly the ML estimator. 


. Since we have var (P (A)) = A, the MM estimator in the case of annual loss numbers 
is: 
T 
4 1 y 2 n? 
Ay = T a Ny, — T2 


If we use a quarterly basis, we obtain: 


ju E n? 
joo Sp 
AQ 4 T2. t Ta) 


There is no reason that Ay = 4Àg meaning that the capital charge will not be the 
same. 
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5.4.3 Using the method of moments in operational risk models 


1. (a) By definition, we have Pr (N (t) = n} = e7*A"/n!. We deduce that: 


INO = SonxPr{N(t)=n} 


(b) We have: 


2 
c 
| 
=) 
— | 
I 
— 
3 
| 
= 
c 
»- 
z|% 


n=0 


The term of the sum is equal to zero when n = 0,1,...,m. We obtain: 


[Tew - a] - Y (n(n=1) (n-m) 


with n’ = n — (m + 1). It follows that: 


We deduce that: 
var (N(t)) = E[N(Q?| - E [N (5) 


= E[N (£) (N) - 1] + E[N (] - E? [N (t) 
Using the formula (5.1) with m = 1, we finally obtain: 


var(N(t)) = AH A-A? 
= 
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(c) The estimator based on the 


(a) We have: 


first moment is: 


(b) Because (X2 ,2;)) = 2%, a? + 3 ug Tix;, it follows that: 


N(t) 
[S] = E b XP +35 SOX; 
i=0 


N(t) N(t) | 


ij 


= E[N (t)] E [X7] + E[N (t) (N (t) - D]E[X;X;] 
= EIN (o]E[X?] + (E [N (0?] - EIN (9]) E (Xd 
We have: 
E[N()) = A 
N (| = var(N (0) +E?[N(t)] 2 A X2 
and: 
[X] = var(X;) +E? [Xj] 
= erto’ (e — 1) + Cane 
- e2ut2e? 
[X]E[X;] = et ao” our o? 
EN eate? 
We deduce that: 
E[S] = AE[X?] + (A M - A) E[X;] E[X;] 


XE [X2] + XE [X;] E[X;] 


Ne2Ht20? + giao? 
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and: 
var(S) = E[s?] —E?[S] 
2 2 d: 22. 2 
EN eth t 20 NE AeL tTI _ 2 (eti ) 
- Aehit20? 
(c) We have: 


[S] = Aetio? 


var (S) = Ae2t2e* 


We deduce that: ^ 
var(S)  Ae?^*?" e7 


22 [5] EU ~ 


It follows that: 


c? = nA + In (var (S)) — In (E? [S]) 


and: 
; l 2 
u = In [5] -InA -= 50 
; 1 12 3 1 
= In :[S] + 5 In ( [5]) - 5 nA — 5 In (var (5) 


Let À be an estimated value of A. We finally obtain: 


1 ~ 1 
Á — In mg 5 nms In A 5 Invs 


and 


ô = Vin Â+ Invs — nm 


where mg and vg are the empirical mean and variance of aggregated losses. 


ew 
= 

© 
SE 


We know that the duration d between two consecutive losses that are larger than 
L is exponentially distributed with parameter A (1 — F (£)). We deduce that: 


l = 
nehty’) 
Sol 
(b) We can ask experts to estimate the return time d; for several scenarios @; and 


then calibrate the parameters A and a using the method of moments and the 
following moment conditions: 


5.4.4 Calculation of the Basel II required capital 


1. In order to implement the historical value-at-risk, we first calculate the daily stock 


returns: 
P, 


TBa ' 


fü 
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TABLE 5.1: Stock returns R4,, and Rpg,s (24 first historical scenarios) 


Ras 
2.01 0.01 0.73 0.71 1.79 2.27 —0.15  —0.55 
—0.43 1.01 0.05 0.32 2.08 —2.37  —0.55 2.57 
0.29 —2.54 —0.03 0.00 —0.90 —0.03 1.96 —0.35 
RB,s 
0.35 —0.84 0.85 1.40 1.35 1.36 —1.45 —1.95 
2.17 1.51 —0.69 1.87 0.06 1.61 1.25 2.20 
1.07 2.85 0.99 —0.06 —2.34 —1.31 3.79  —1.46 


where P, is the stock price at time t. We report the return values for stocks A and B 
in Table 5.1. These data are used to simulate the future P&L defined as follows: 


II; 


10000 x 105.5 x Rais + 


25 000 x 353.0 x RB,s 


where R4,, and Rg,, are the stock returns of A and B for the s* historical scenario. 
Table 5.2 gives the values taken by Ils. We then calculate the order statistics 115.550 
and deduce that the value-at-risk is equal to: 


VaR os, (w) 


$318 883 


It follows that the required capital is equal to: 


KMR 


$3.53 mn 


1 
5 (323072 + 314 695) 


(3 + €) x V10 x VaRoo% (w) 


TABLE 5.2: Daily P&L (24 first historical scenarios) 


Daily P&L II, 


9972 —74339 67 520 115 824 137 790 144 032 
—129857 —178 339 186 837 143722  —60 767 168 780 
16234 —166679 —116117 221553 —91336 —278 402 
—87 357 —5671 —215517 —116 172 354813 —132 741 
Order statistic 115.250 
—340656 —323072 —314695 -—278402 -—277913 -—275118 
—268632 —259 781 —255936 -—252509 -—250117 —249523 
—243502 -—218295 -217514 -217327 -215517 —211 382 
—211018 -—208061 -—192950 -—192603 -—190993 -—189410 


2. We apply the IRB formulas with the right asset class exposure. For the bank meta- 


credit, we have: 


p (PD) 


1296 x 
19.2896 


1 — e-50x176 


1 


J= 
rao + 24% x 


(1 _ ee) 
1 — e750 
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Because the maturity of the meta-credit is one year, the maturity adjustment is equal 
to 1. We deduce that: 


"T ®-1 (1%) + 0.19280! (99.9%) 
K* = 7% x ® ( Jt 0.1908 


) 75% x 1% 
= 9.77% 


It follows that: 
RW = 12.5 x 9.77% = 122.1396 


and: 
RWA = 80 x 122.1396 = $97.70 mn 


We finally obtain: 
K = 8% x 97.70 = $7.82 mn 


For the corporate meta-credit, we proceed in the same way, except that we have to 
incorporate the maturity adjustment. We have: 


b (PD) = (0.11852 — 0.05478 x In (596))? = 7.99% 


and: 
1+ (2 — 2.5) x 0.0799 


e (M) = — 75 x 0.0799 


Using the IRB formula, we obtain K* = 15.35% and K = 30.69%. For the SME meta- 
credit, we have to be careful when we calculate the correlation. Indeed, we have!: 


— 1.0908 


z 1 — e7 50x2% 1 — (1 — e750x2% 
pone (PD) = 12%.~x exer + 24% x C EE ) 
30,5)—5 


For mortgage and retail exposures, we use a one-year maturity. The default correlation 
is set equal to 1596 for the mortgage meta-credit whereas we consider the following 
formula for the retail meta-credit: 


MES e 395x476 T= (1 = prose) 
3% X —— 39-1676 x PE 


E= 
6.21% 


I 


p(PD) 


II 


All the results are reported in Table 5.3. At the bank level, we then obtain: 


RWA = 97.70 + 383.65 + 55.95 + 97.87 + 122.80 
$757.98 
and: 
Ker = 7.82 + 30.69 + 4.48 + 7.83 + 9.82 


$60.64 


lIn order to simplify the calculation, we assume that the USD/EUR exchange rate is equal to 1. 
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TABLE 5.3: Calculation of capital requirements for credit exposures 


Exposure b(PD) (M) p(PD) k* RW RWA K 

Bank 0.14 1.00 19.28% 9.77% 122.1396 97.70 7.82 
Corporate 0.08 1.09 12.99% 15.35% 191.83% 383.65 30.69 
SME 0.11 1.46 14.64% 8.95% 111.91% 55.95 4.48 
Mortgage 15.00% 15.66% 195.74% 97.87 7.83 
Retail 6.21% 9.82% 122.80% 122.80 9.82 


3. We calculate the capital charge for operational risk by Monte Carlo methods. The loss 
is equal to: 


where L; ~ LN (8,4) and N can take two values (N = 5 or N = 10) with: 


Pr{N=5} = 60% 
Pr{N=10} = 40% 


We first simulate the yearly number of operational losses N by inverting the cumulative 
density function: 


Pr{N <5} = 60% 
Pr(N <10} = 100% 


Let u, be a uniform random variate for the sí^ simulation. The simulated variate N, 


is defined as follows: 
N.= 5 if us <0.6 
5 | 10 otherwise 


Then, we have to simulate the operational losses Le using the probability integral 


transform: 


U 


l 
E 
B 


It follows that: 
L; = exp (8 + 2 x 6^! (U)) 


Let ul?) be a uniform random variate. We have: 
Le = exp (s 42x01 (ut?) 
Another way to simulate Le is to notice that 9^! (U) ~ (0,1), meaning that: 
Le = exp (s +2x nf?) 


where n(? is a normal random variate M (0, 1). The simulated value of the aggregated 


loss is then: 
Ns 
b= Dn 
i=1 
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Let us consider an example. We assume that u, = 0.2837. It follows that N, = 5. 
This means that we have to simulate five operational losses in the year. We obtain 
the following figures: 


i 1 2 3 4 5 
ul?) 0.4351 0.0387 0.2209 0.3594 0.5902 
$-! (ut?) —0.1633 —1.7666 —0.7692 —0.3600 0.2282 
Le 2150.26 87.09 640.05 1451.02 4704.84 


The first loss experienced by the bank is $2 150.26, the second loss is equal to $87.09, 
etc. We deduce that the yearly total loss is equal to $9 033.25: 


L, = 2150.26 + 87.09 + 640.05 + 1451.02 + 4704.84 
$9 033.25 


By considering ng simulated values of Ls, the capital charge for operational risk is 
given by the 99.9% quantile: 


VaR 99.99, = Lo.999n3:ns 


For instance, if we consider one-million simulation runs, the capital charge corresponds 
to the 999 000" order statistic. In our case, we estimate the capital charge with 250 
millions of simulation runs and obtain: 


Kor = $4.39 mn 


Because the required capital is estimated using Monte Carlo methods, there is an 
uncertainty on this number. For instance, we have reported the histogram of the VaR. 
estimator with one-million simulation runs in Figure 5.1. In this case, we obtain: 


4. We deduce that the capital ratio of the bank is equal to: 


CBank 
RWA +12.5 x Kmr + 12.5 x Kon 
70 
757.98 + 12.5 x 3.53 + 12.5 x 4.39 
8.17% 


Cooke ratio = 


5.4.5 Parametric estimation of the loss severity distribution 


1. We consider that the losses follow a log-logistic distribution. 


(a) By definition, the probability density function is equal to: 


OF (2; a, B) 


f (z;a, 6) = Da 
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Frequency (in Z) 
> 


4.20 425 430 435 440 445 450 4.55 
VaR (in $ mn) 


FIGURE 5.1: Histogram of the VaR9g 9s, estimator with ng = 10° 


We deduce that: 
(8/0) (2/a)f™ (1+ (2/a)") 
(1+ af] 
(z/)" (b/a) (w/a)? 
(1+ e/a) 
(b/a) (z/a)" ^ 
(1+ G0) 


f(x;a,8) = 


(b) The definition of the log-likelihood function is: 


£(a,B) = Yin 
We deduce that: i 
£(a,B) = nln(f/a)- (8 —1) Silk (x;/a) — 257m (1 + (zifa)) 
i=1 i=1 
= nln8 —nfglno-(8-— iSc ds — 
i=1 


257 In (1 + (zifa)) 


i=1 
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(c) Maximizing the log-likelihood function leads the first-order conditions: 


O£(a, (zi a)? 
(89 _ 8 at " 


and: 


O£(a,B) n (zi/o)? Cor 
GP =F -nmatY ma; 23 Ec =0 


By assuming that 8 Z 0, we deduce that: 


SOF (zio. 8) 73 
i=1 


and: 


5 -nimat rna apt - T7 2n DE (vifa) g-0 


1+ (z;/a)? 


We then obtain: 


E Ina; —2 F (z;;o, 6) naz; = 0 
3 x 2 ) 
or equivalently: 


1 B 


It follows that the ML estimators @ and B satisfy the following conditions: 


boue F (z; â, 8) =n/2 
4 (2F (zi. 8) = 1) ln z; = n/B 


(d) Using the sample of loss data, we obtain: 


10 
22 F (ss a 8) — 10.000 


M: 


a 


and: 
10 
By” (2F (z; â, 8) = 1) In z; = 9.999 
i=1 
Because the two mathematical terms are equal to n = 10, the first-order condi- 


tions of the ML optimization program are satisfied. 
(e) We have: 


(0,8) = Sof (a;a,8)— >In (1— F (H; a, p)) 
i=l i=l 
= e E E 


23 In (1 + (0:/a)®) + nn (1+ (H/a)*) 


i=l 
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5.4.6 Mixed Poisson processes 


1. We recall that E [P (A)] = var (P (A)) = A. We deduce that: 


INO] = ElE[N (5 | All 
= EA] (5.2) 
and: 
var(N(t)) = E[N] -E?(N (5) 
= E[E[N (0? | A|| 2E? EIN | A] 
= E [var (N (0) | A) +E? [N (£) | Al] - E? [E [N (5) | All 
= E[var(N (t) | A] + E [E? [N (t) | AJ] — E? [E [N (t) | Al] 
= var (N (t) | A)] + var (IE [N (t) | A]) 
= A] + var (A) 
= N (t)] + var (A) (5.3) 


2. By definition, we have var (A) > 0, which implies that: 


var (N (t)) > E[N (#) 


The equality holds if and only if var (A) = 0. We deduce that A must be constant and 
we obtain the Dirac distribution: 


Pr{A=A}=1 
Since we have N (t) ~ P (A), we deduce that: 


p(n) = Pr{N(t)=n} 
e ^A? 


n! 


It follows that: 
(n -- 1): p(n4- 1) 


y(n) = 


p (n) 

u eg ^Xer n 

lcs (n+ 1)! e 

= À 

3. (a) We reiterate that: 
BAs 
and: A 
var (G (a, B)) = gi 
Using Equations (5.2) and (5.3), we obtain: 
[N (0) = E[A] 
= E[G(a,8)] 


(5.4) 


z 
B 
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and: 


var (N (t)) = E[A] + var (A) 
= E[G (a, b)] + var (G (a, B)) 


(b) By definition of the compound distribution, we have: 


Il 


pii [re Py 101 G(@,8)) a 


7 5 e ^A fe Ae-le-8^ 
Em ^ n! T (a) 


p? T —A(8+1) \n+a—1 
= ne? _s n+a d 
nT (a) Jy * i a 


We know that: E 
ji tle t dt =T (a) 


0 
We deduce that?: 


if tote dt = n tele dt 
0 0 
l (e 2g dx 
E e? 
0 b b 
= li vt le^? dx 
0 


I 


_ F@ 
b 
From Equation (5.6), we obtain: 
~% Tinta 
pn = ree 
|. T(n+a) Be 
nI (a) (Gan 
We notice that: 
Tinta) — (nt+ta-1)! 
nT (a) ^ n!(a- 1) 


I 


pum 


ays c GG 
(G+ B+1) \B+1 
f 1 

+ 


and: 


2We use the change of variable x = bt. 


(5.5) 


(5.7) 
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Therefore, Equation (5.7) becomes: 
n+a-1 1 Š 1 E 
iiim ( n )o oa) Ga 
n+r—-1 
— T T gn 
( 4 ) p) p 


This is the probability mass function of the negative binomial distribution 
NB (r,p) where? r = a and p = 1/ (B + 1). 


(c) We have: 
2 (n+1)-p(n+1) 
= (n+l) 2) (tap p 
| pu (1— p)' p^ 
- (n — r)! 
orc 
= pnt+opr 


4. (a) Since we have EJE (A)] = A^! and var (E (A)) = \~?, we obtain: 


1 
N (t) == 
MO 
and: 
1 1 
var (N (t = -+5 
(N) = r3 
NECS 
= Xi 
3 An alternative approach to find the values of r and p consists in match the first two moments. Indeed, 
we know that: 2 
E [NB (r.p) = 
1-p 
and: pr 


var (NB (r, p)) = TF 


We deduce that: 


a pr Qo. pr 

B  1— B T= 

am pr M BEI. 
g (1— p)? B  1=p 


t 
R eee 
WD ® 
| 
= d 
|| | 9 
= 
MESS 
"3 
| 
= 
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(b) We have € (A) = € (1, A). We deduce that the compound Poisson distribution is 


the negative binomial distribution MB (r,p) where r = 1 and p = 1/ (A 4- 1). In 
this case, the expression of the probability mass function becomes: 


p(n) = ("1 a-a 
= (1-pp 


We conclude that N (t) has a geometric distribution G (1/ (A + 1)). 
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7.4.1 Constant amortization of a loan 
1. We have: 
2 A 


æ = M 


— 0i) 


1 


A V 
i an YG 
ie 


t 


1 
+i) 


1 
ns 
- < 088 


(14 i) 1- T 


= (mr)? 


= C(n) A 


where Cin) is the capitalization factor: 


1—(14+%)” 


1 


(at) fen 


We deduce that the value of the constant annuity is equal to: 


= (rais) 


7 (14 i)" iN. 
c MESE 
i 


= — Mo 
1- (14 i) 


C(n) = 


2. Since Co = No, we have: 


It follows that the constant annuity rate a(n) is given by the following formula: 


i 1 


a(n) = \—n 
1— (14 i) C(n) 


3. At time t = 1, we pay A. The interest payment is equal to 7 (1) = iNo while the 
principal payment is equal to the difference between the annuity and the interest 


119 


120 Handbook of Financial Risk Management 
payment: 
P(1 = A-I(1) 
= (am =i) No 
i 
RUE 
1— (1-4 1i) 
We deduce that the amount outstanding (or remaining capital) is equal to: 


N (1) No — P 


I 


We also have: 


Since we have: 


dp 0+) ud (£ +i) (1+ 277) 
Ew eee (+i) (1+i)"-1 
7 1 (opa Sea? 
ER CT x | (1 i)" 1 
7 1 
(+i) 


we conclude that the amount outstanding N (1) is equal to the present value of the 
annuity at time t = 1: 


4. More generally, we have: 
N(t) = C(t) 
= C(n—1t)À 


7 ( ze S) x 
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It follows that: 


I(t) = iN(t-1) 
1 
- (aes) 
and: 
P(t) = A-I(t) 
1 
e Gg 


7.4.2 Computation of the amortization functions S (t, u) and S* (t, u) 
1. By definition, we have: 
S(t,u) = 1{t<u<t+m} 
" { 1 ifwe [t,t+ m[ 


0 otherwise 


This means that the survival function is equal to one when u is between the current 
date t and the maturity date T = t +m. When u reaches T, the outstanding amount 
is repaid, implying that S (t, T) is equal to zero. It follows that: 


* ( ) —oo 
S 1 u = 
S 


fÉ NP (s) L{s<u<s+m} ds 
[NP (s) 1(s € t € s-- m) ds 


For the numerator, we have: 


l{s<u<stm}=1 > u<s+m 
= s»u-—mn 


and: 
t 


1 NP() Ms Sucsem) ds f NP (s) ds 


— oo um 


For the denominator, we have: 


I{s<t<st+m}=1 > t<s+m 
= s>t-m 


and: : ] 
J NP(s)-{s<t<s+m}ds= f NP (s) ds 
t-m 


— oo 


We deduce that: 


P E NP (s) ds 
fi NP (s) ds 


t-m 


S*(t,u) =1{t<u<t+m}- 
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In the case where the new production is a constant, we have NP (s) = c and: 


S*(t,u) = L{t<u<t+m}. 


= I{t<u<t+m}.- 


= Ift<su<t m}-( 


—t 
= I{t<u<t+m}- (i-? ) 
m 
The survival function S* (t, u) corresponds to the case of a linear amortization. 


2. If the amortization is linear, we have: 


S(t,u) =1{t<u<t+m}- (i- 2-5) 


ns (s) (: = —*) ds 
[xo (1- —) ds 


In the case where the new production is a constant, we obtain: 


t P 
J (1 ncm J ds 
um m 


We deduce that: 


S*(t,u)=1{t<u<t+m}- 


S*(t,u) =1{t<u<t+m}- 


For the numerator, we have: 


t — 
/ (1 "m Jj ds — 
um m 


2 t tu 4 t? m u 
B m 2m d 2 2m 


m? +u? + t2 + 2mt — 2mu — 2tu 
2m 


(m — u t)? 
2m 


For the denominator, we use the previous result and we set u — t: 


t = te 2 
D (i-* z) P (m—t+t) 
I—m m 2m 


m 
2 
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We deduce that: 


(m — u 4- t)? 
S*(t,u) = 1{t<u<t+m}- zm 
2 
(m — u 4- t)? 
= Dis <tm}+ i a 
u—t A 
= l{t<u<t+m} |1- 
m 


The survival function S* (t, u) corresponds to the case of a parabolic amortization. 
3. If the amortization is exponential, we have: 
S (t, u) E he Ads = e ^u-0 


It follows that: F 
J.. NP (s) e Mu-9) ds 


S* (t,u) = 75 
Jo NP(sg)e-M-9 ds 


In the case where the new production is a constant, we obtain: 


lacs e7 Alu- s) ds 
J e-^t-3) ds 
[X-1g-3-9)] 


—0o 
on t 


[Are 0-9] 
eTA ut) 


= S(t,u) 


S*(t,u) = 


— oo 


'The stock amortization function is equal to the flow amortization function. 


4. We recall that the liquidity duration is equal to: 
D (t) =f (u — t) f (t,u) du 
t 


where f (t, u) is the density function associated to the survival function S (t, u). For 
the stock, we have: 


D* (t) m (u — t) f* (t,u) du 
t 
where f* (t, u) is the density function associated to the survival function S* (t, u): 


_ JS NP (s) f (s,u) ds 
i, NP (3)S (s,t) ds 


f* (tu) 


In the case where the new production is constant, we obtain: 


SE (u — t) tee f (s,u) dsdu 
foes S(s,t) ds 


D (t) = 
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Since we have ex f (s,u) ds = S (t, u), we deduce that: 


JE (u—t)S(t,u)du 
qs S (s,t) ds 


D* (t) = 


5. (a) In the case of the bullet repayment debt, we have: 


and: 


D* (t) = t 


ME 


(b) In the case of the linear amortization, we have: 


f(u) - Mt Suctem) = 


and: 


S 
Í 
Sls s/o 
3 
en 
g 


For the stock duration, we deduce that 


D*(t) = 
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The numerator and denominator are equal to: 


u? u3 tu? gal" 
(*) = tu | 
2 3m m m |, 
1 2 3 2 2, ttm 
= aem [3mu 6mtu — 2u? + 6tu 6t u], 
1 3 2 3), 1 2) 543 
= gu" -3mP — 248) + — (amt? 26) 
Vie 
c. (ule 
and: 
t 8g] 
e = b-£-zx] 
m 2m], om 
1 t 
= 2,5; 8. 28 t- m], in 
1 
= 5 (? 2t (t — m) — (t-m)? +2(t m?) 
m 
1 
= 5,; (P - 2€ + 2mt +t? -2mt + m?) 
m 
"E 
EN 


and!: 


I 
oo 
8 
S 
> 
2 
g 
a 
[S 


For the stock duration, we deduce that: 


a = Je (u—t)er%—9 du 
f e-^t-3) dg 

i ve?" do 

do^ e dv 

1 


A 
We verify that D (t) = D* (t) since we have demonstrated that S* (t, u) = S (t, u). 


1 We use the change of variable v = u — t. 
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6. (a) By definition, we have: 
dN (t) = (NP (t) — NP (t — m)) dt 


(b) We have: 
1l{s<t<s+m} 
m 


f (st) = 


It follows that: 


f NP (s) f (s,t)ds = A l{s<t<s+m}-NP(s) ds 


E I 
E zl NP (s) ds 
M Jt—m 
We deduce that: 
1 t 
dN (t) = (xp (t) — zl NP (s) as) dt 
m t-m 


(c) We have: 
f (s.t) = Ae 9079 
and: 


f NP (s) f (s,t)ds = ri NP (s) 4e 79 ds 


— oo — oo 


t 
af NP (s) e-*-9 ds 
— AN(t) 


We deduce that: 
dN (t) = (NP (t) — AN (t)) dt 


7.4.3 Continuous-time analysis of the constant amortization mortgage 
(CAM) 


1. We have dN (t) = —P (t) dt where A (t) = I (t) 4- P (t) and I (t) = iN (t). We deduce 
that: 
dN (t) = (iN (t) — A) dt 


We know that the solution has the following form?: 


N (t) = Ce* + de 
i 


where C is a constant. Since N (0) 2 No, we have: 


A 
C-No—-— 
1 


?'The solution of y’ (t) = ay (t) + b is equal to: 


y (t) = Ce% — 
a 
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and: 
AN ;, A 
N(t)= (m - 3 dite 


a 


At the maturity m, we have N (m) = 0, implying that: 


AN ; A A Noe n 
(m- ien $-o do P ete UE 
(3 D 


We deduce that: 


IN : 
N() = Lem): (0 Len DA 
1 — e-i(m-t) 


1-—e-im 


= 1{t<m}-No 


because N (t) = 0 when t > m. 


N 


. More generally, we have: 


is e i(t+m—u) 


N(t,u) =1{t<u<t+m}-N(t) 7 
—e 


'This implies that: 


S(t,u) =1{t<u<t+m}- 


and: 
f : 
NP i= —i(s--m—u) d 
S* (t, u) = Jum (s) ( & : ) d 
Si-m NP (s) (1 — e-*(t-9) ds 


If we assume that NP (s) is constant, we have: 


t 
[s+ sett] 
l 
S*(t,u) = i E 
s+ M] 
l 


t—m 
e K(tm-u) —1 
t+m—-ut+ 


i(t-- m— u) +e ttm) _ 1 
im 4 em —] 


3. We have: 
ie (tt m-u) 


1- e-im 


f(t,u) =1{t<u<t+m} 
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It follows that: 


S 
A 
SE 

Il 

T 
eJ 
Y 
3 
AEDA, 

Q 

| 
= 
c. 

€ 

D 

(em 

Q 


because we have: 


S 
3 
e 
IN 
z 
Q 
e 
ll 
rs | 
[sd 
e. SN 
E 
LL] 
2 3 
om 
3 
œ 
~ >. 
S 
aQ 
e 


iy qm iy qm 
ES B 
= [8 
V io V d0 


7.4.4 Valuation of non-maturity deposits 


1. The current market value of liabilities is the expected discounted value of future cash 
flows, which are made up of interest payments i (t) D (t) and deposit inflows 0D (t): 


Io =E | | » e" (4 (t) D (t) — &,D (t)) a (7.1) 


0 


2. Since we have: 


and: 


oo 


f «ano dt = |» 4 forme dt 


0 0 


Il 


-D+ | e^" Otr (t) D (t) dt 
0 


we deduce that: 


Lo = | f ^ eO (i) D(t) — 3D (£)) ar 


0 


= Dot | | Cen (4 (£) — r (£)) D (t) a (7.2) 
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3. The current value of deposit accounts is the difference between the current value of 
deposits Dg and the current value of liabilities Lo: 


Vo = Do- Lo 
= M. (ri) DO at 


- E | "i ^ etg (t) D (t) a (7.3) 


where m(t) = r(t) — i(t) is the margin of the bank. This is the equation obtained 
by Jarrow and Van Deventer (1998), who notice that Vo is “the net present value of 
an exotic interest rate swap paying floating at i (t) and receiving floating at r (t) ona 
random principal of D (t)”. 


4. If the margin m (t) is constant, we obtain: 


I 


Vo E | | eO (t) D (t) a 


= mgl |j ererar D% 
0 


= mori Do (7.4) 


where ræ can be interpreted as the average market rate?: 


1 
c ELI Ora] 


5. The variation of i(t) is equal to a constant o plus a linear correction term 
B (r (t) — i (0): 

di (t) 

dt 


= a-f(r(t)—i(t)) 

= Br(t)d (o — Bi(t)) 

It follows that i(t) is an increasing function of r (t). Moreover, i (f) decreases (resp. 
increases) if a — Bi(t) < 0 (resp. a — Bi (t) > 0). This implies that i(t) is a mean- 
reverting process, where the steady state is iœ» = /-lo. The variation of D (t) is 
explained by two components: 


dD (t) 


Cı (t) C2(t) 


In the case where r (t) is constant, we notice that: 


E |/ ena 
0 


Il 
o— 
8 
(c 
l 
x 
Ri 
e. 
cM 


Il 
a 
7 
B 3 
ES 
LI 
E 8 


This justifies that roo is an average interest rate. 


130 


Handbook of Financial Risk Management 


The first component Ci (t) is the traditional mean-reverting adjustment between the 
deposit D (t) and its long-term value Da, whereas the second component C; (t) is the 
negative impact of the excess of the market rate over the savings rate. It follows that: 


i(t) = etio + Zi ere? (r (s) + =) ds (7.5) 


and: 


D (t) = e Do + (1 — et) Da - sf e Vt-8) (r (s) —i(s)) ds (7.6) 


. In the case where r (t) is constant and equal to ro, we obtain: 


t 
ilt) = ePi (a+ Bro) f e P(t-9) ds 
0 


-B(t-5) 1f 
— g- Pts (a Bro) E 3 i 


= e Prin + (1 — e Pt) (v t 2) 


= ido4 (1-e*) (r i 7 - io) (7.7) 


It follows that: 


"6-9 = m= (Moe 0- (45 )) 


= e (n — to) — 5 (1-e*) (7.8) 
and: 
t 
D(t) = Doo +e (Do - Deo) = ô (ro — io) f e (t7 ebs ds + 
0 
t 
93 ec? Le) ds T 
B Jo 


Since we have: 


and: 


t t—s t—s)—Bs t 
n e (7-3) (1 = e 8s) ages E x ) e ?( ) | 
0 ^Y y= B 0 


we deduce that: 


D(t) = Do te 7% (Do — Dæ) 


ad (+ =e  ¢ Bt —) 
B a qeu 
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7. If a is equal to zero and we combine Equations (7.8) and (7.9), we obtain: 


eT (r (t)—i(t)) D(t) = etet (ry — io) Doo + 
e Cot (ro — io) (Dg — Doo) — 


6 (e7 (ro28)t — e7 (roc 8 )t 
( (ro io)? 
yap 
It follows that: 
Vo => j e (rot Bt (ro EE io) Das dt + 
0 


J e otte) (ro — io) (Do — Dæ) dt — 
0 


99 $e-(rot28)t 
| (ro io)” dt + 


0 Ae 
oo e- (ror B)t 2 
r i dt 
] = o-i 
We also have: 
= (ro = io) E 
Vo = H 
= (ro ag p) 0 
— (ro — io) (Do — el li 
EE (ro pe eese B) 0 


de- (0 *25 (ro — ig)? sil i de (T0474 P)E (ro — io)? ü 
— (ro 28) (y - B) |, (ro - y+ 8) Y — 8) |, 


'Therefore, the net asset value is equal to: 
y, = (0-10) De. , (ro — io) (Do = Do) 
Í (ro +8) (ro +7 + B) 
5 (ro — io)” 5 (ro — io)" 
7.1 
Cott-  (28)0 -8) D» 
We deduce that the sensitivity of Vo with respect to ro is equal to: 
3 Vo = (io + b) Doo | (Do — Dæ) (io Y +8) j 
aro (ro+ BY (rot Y+ BY —— 
ô (ro — io) (ro + io + 2 (7 + 8)) 
(ro + Y - 8)! (v — 8) 
ô (ro -— io) (ro-t Fio + AB) (7.11) 
(ro + 28) (v — B) 


8. From Equation (7.3), we deduce that: 


2 = |- J 7 tent (r (t) —i(t)) D (t) a + 
n rat? CO p) ar y 
|f e aaar eo ar! (7.12) 
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De Jong and Wielhouwer (2003) observe that the sensitivity of the net asset value is 
the sum of three components: the interest rate sensitivity of the expected discount 
margins, the margin sensitivity with respect to the market rate, and the impact of 
r (t) on the deposit balance D (t). Since we have: 


8(r(t)—i(0) _ -e 


=e 
Oro 


and: 


we deduce that: 


Te =~ lot) te DE ars 
Oro 0 
ceo ditm —pt eo 
d J ee =] D (t) dt+ | EO pq dt = 
0 B 0 
Ò (ro — io) l — (ro--28)t —(rotB+y)t 
—— n e T —£6€ i dt + 
y—B 0 ( ) 
ad S? f | 1 m —) 
(ro*8) | e-(rottY[| 7 7 \dt 7.13 
e e : 
*Y — B Jo ( ) ( B 


'This sensitivity can be computed analytically, but it is a complex formula with many 
terms. This is why it is better to calculate it using the Gauss-Legendre numerical 
integration method. The duration of deposits is then defined as: 


1 OW 
pss 
x Vo Oro 


In Figure 7.1, we have represented the deposit rate i(t) with respect to the time t. 
We notice that: 


h 4 " P — Bt a 2 
Ami = dim io (1 e”) (rot $ io) 
Q 
= na 


Since the margin is equal to r (t) — i (t), it is natural to assume that a < 0 in order 
to verify the condition? i(t) « r (t). The dynamics of D (t) is given in Figure 7.2. It 
depends on the relative position between Do and Dæ. Another important parameter 
is the mean-reverting coefficient y. In Figure 7.3, we have represented the mark-to- 
market Vo, its sensitivity with respect to rg and the corresponding duration. We notice 
that the normal case where ig < ro corresponds to a negative duration, because the 
sensitivity is positive. We explain this result because a = 0 is not realistic, meaning 
that the margin is equal to zero on average. If we assume that « is negative or the 
margin is positive, we obtain a positive duration (see Figure 7.4). In particular, we 
verify that the duration of deposits is higher when market rates are low. 


^'This is the arbitrage condition found by Jarrow and van Deventer (1998). 


i(t) (in Z) 


Deposits 
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FIGURE 7.1: Dynamics of the deposit rate i (t) 


t (in years) 


FIGURE 7.2: Dynamics of the deposit balance D (t) 
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FIGURE 7.3: Duration of deposits when a is equal to zero 
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FIGURE 7.4: Duration of deposits when the margin is positive 
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7.4.5 Impact of prepayment on the amortization scheme of the CAM 
1. We deduce that the dynamics of N (t) is equal to: 


—je-i(m-t) 


dN(t) = 1(t«m]- No— ——— dt 


l]—e-m 
= e-im 


; 1 
= iem) (1 {t< m}. No) dt 


;o—i(m—t) 
Se tN id 


B 1 — e-i(m-t) 


2. The prepayment rate has a negative impact on dN (t) because it reduces the out- 
standing amount N (t): 


; o i(m—t) 


c e ~ E 
3. It follows that: 
: je-(m—t) 
dinN (t) = — (; mo + 0) dt 
and: 
t —4ie- im s) t 
InN (t) -lInN(0) = | ds Ap (s) ds 
i 1 — e-i(m-3s) Ü 
t t 
= [Ihn(1—-e-*-9» | — | à(s) ds 
( at 
ie e~t(m—t) t 
and: 
" j= —i(m-t) _ ft sYds 
RQ = (wo) ohio 
— € 


= N(tS,(t) 
where S, (t) is the survival function associated to the hazard rate Ap (t). 
4. We have: 


E 1—e? = 
Ñ (tu) 21(tX u « t4 m)- N (t) =E eu) 


this implies that: 


€ Ap(u-t) —e imd (i—Ap5)(u—t) 


S(t,u) =1{t<u<t+m}.- 


1-—e-im 
and: 


e ^»(u-t) + (i = Àp) e—im+(i-Ap)(u—t) 


1— e-im 


f(t,u)=1{t<u<t4 dp 
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D(t = 


because we have: 
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t+m 
Ap i (u — t) e?» 79 du + 
t 


1-e-im 


SEE —im t+m ! 
(i — Ap) e / (u — t) "9 du 
t 


J= e-im 


item = 
Xp c= e ^em — J 
1—e-im \ =r, A 
(i — Ag)e melt—Av)m ^ eli-Ap)m —] 
1—e-im | (i — Ay) (i= àp) ) 


1 e im _ e ^m " 1— e ^m 
1—e-im i — Àp Àp 


Àp / ve ^»" dv + (i aH Àp) e j velirAp)v dv 
0 0 
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Part II 


Mathematical and Statistical 
Tools 


Chapter 9 


Model Risk of Exotic Derivatives 


9.4.1 Option pricing and martingale measure 


1. Since we have: 


[9v 


we deduce that: 


w= LO 9080 
It follows that: 
dV (t) = (t) dS(t) + p(t) dB (t) 
= o(t) dS (t) - r(V (t) — 6(t) S (t)) dt 
= rV (t) dt 4- d(t) (aS (t) — rS (t) dt) 
We have: 
d$(t) = —re~"'S (t) dt +e" dS (t) 


= e "(dS (t) —rS (t) dt) 
It follows that: 


dV (t) = rV (t) dt+¢(t) (dS (t) — rS (t) dt) 


= rV (t) dt+e™¢d(t) dS (t) 


Finally, we deduce that: 
dV (t) = -—re '"'V (t) dt +e™™ aV (t) 
= -re V(t) dt - e^"* (rV (t) dt + e™*¢ (t) dS (t)) 
= é(t)d$(t) 


. Under the probability measure Q, we remind that: 


dS (t) = rS (t) dt + oS (t) aW? (t) 


'Then, we have: 


Il 


d$ (t) e™™ (dS (t) — rS (t) dt) 

= e "oS (t) aW? (t) 

= oS(t) dqW? (t) 
We conclude that $(t) is a martingale. Since dV (t) = ¢ (t) d8 (t), V (t) is also a 
martingale. We deduce that: 


V (t) =E? [V (| FI] 


141 


142 Handbook of Financial Risk Management 


and: 
V(t) = eE? eV GL 
oP OES V (I Fa 
4. We have: 
dS (t) = uS (t) dt + oS (t) aW (t) 
and: 
d$(t) = e" (dS (t) —rS(t) dt) 
(u — r) Š (t) dt +08 (t) dW (t) 
We set: 


Using Girsanov's theorem, we know that WO (t) is a Brownian motion under the 
probability measure Q defined by: 


dQ 


= = MO 


= oo Sf (22) a fe) are) 


Moreover, we know that M (t) is an F;-martingale. 


5. We have: 


and: 


We deduce that: 


V(0) = e"'TEP[n(S(T) > K} 
e Pr{S (T) > K} 


et Prf We (T) > = (mK In So ¢ 507) T) } 
= = E (mx In So (r 507) T) 


Therefore, the price of the binary option is: 


V (0) = eT (= (1 ul 4 T) = jT) 
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9.4.2 The Vasicek model 
1. We have: 


1 49?B(t,r) 
por BD) L0) Ao) 


and: 


OB(t,r), 9 B(t,r) 
Or Ot 


r (t) B(t,r) 20 


B(T,r(T)) =1 


2. We remind that the solution of the Ornstein-Uhlenbeck process is: 


t 
r (t) 2 roe ** cb(1—e **) +o J e 6-9 dW (s) 


0 


It follows that: 


Z 


II 


T t 
/ (room +b(1—e7%*) + o f e26-0 qw 9) dt 
0 0 
eat T eat T T t 
ro |- | +b f + | + o f | e* 6-9 dW (s) dt 
a |, a jo o Jo 
1—e74? TE 
bT + (ro »( ) o f J e 6-9 dW (s) dt 
a 0 JO 
We note I = je Js e* 5-0 dW (s) dt. Using Fubini's theorem for stochastic integrals, 
we have: 
T pT 
I= n / etts) ds dW (t) 
o Jt 


T = = 

i= a(T-t) 

= i Lf awit) 
0 a 


Il 


I 


Since J is a sum of independent Gaussian random variables, it follows that Z is also 
a Gaussian random variable. 


3. We have: 
1—e-«T 
z [Z] = bT + (ro »( ) 
a 
and: 
T qa e-«(T-t) a 
var (Z) = E o f — — — —— dW (t) 
0 a 
2 pT 
= zl (1 pem dt 
a” Jo 
2 jT 
EN 5] (1 de-UT-t) 4 q-2a(T ) dt 
a2 0 
c? —a 1 —2a 
x (r [Lcd ves bone ey) 


Another expression is: 
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where: 


B 


4. We have: 


B (0, ro) 


II 


1— e 2T 


2L [e 7| Fo] 


Under the probability measure Q, r (t) is an Ornstein-Uhlenbeck process: 


dr(t) = 


where: 


It follows that: 


and: 


) dt + c aW (t) 
) dt +o (aW? (t) — dt) 
) 


-E2 [Z] + svar (Z) = 


I 


Finally, we obtain: 


2a? 2 

2 292 

nB- V (Tf) + 7, (7-8 - 27 
2 292 
rof (v 2. 3 8-*7 


2022 
B (0, ro) — exp ( roD (v zs) (T B) vs ) 


9.4.3 The Black model 
1. We have: 


30? F?02C (t, F) + QC (t, F) - rC (t, F) =0 
{ € (T, S (T)) = max (F (T) — K,0) 


2. The Feynman-Kac formula is: 


C(0)—e77 


[max (F (T) — K,0)| Fo] 


We know that F (T) is a log-normal 


random variable: 


F (T) = Fye- $9 T *e(WCr)-W(90) 
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We note I = E [max (F (T) — K,0)| Fo]. We obtain: 
o6 + 
= P (Rd TV — K) $ (u) du 


7 J (Foe 3e T YT — K) o (u) du 
d 


gei f e? V Tug (u) du-x | $ (u) du 
d d 


II 


where: 
d= a In n jov 
We have: 
[ow du = 1- (d) 
° = d$(-d) 
and: 


v2n 
E" lg?T 2 1 -ií(u-c T) 
= e? e 2 du 
[ V2n 
= 222 1 — 5v 
= e? e 2" dv 
d-o yT V20 


Finally, we deduce that: 


C(0) = eT (me? eir (d+ oVT) — Ko (-d)) 


where: 


and: 


1 F 1 
d2 = In £ — <oVT 


. Under the risk-neutral probability measure Q, we have: 
dS (t) = rS (t) dt + cS (t) aW? (t) 
'The price of a future contract on this stock is equal to: 


F(t) - eS (t) 
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Using Ito’s lemma, we deduce that: 

dF(t) = -re "*S(t) dtc e "* dS (t) 
ce" S (t) aW? (t) 
= oF (t) dW? (t) 


I 


We can then apply the Black formula to price an European option on F (t). 


. In this case, the PDE representation becomes: 


no hoic (t, F) + aC (t, F) - r (t) C(t, F) 20 


It follows that the Feynman-Kac formula is: 


T: 
C(0)=E [e T (F (T) — K, 0) zi 
Since r (f) and F (t) are independent, we obtain: 


c() = ie Joea 
= B(0,T)-(Fo® (di) — K® (d2)) 


Fi -E [max (F (T) — K,0)| Fo] 


We deduce that the discount factor e~"” is replaced by the current bond price B (0, T). 


. If r(t) and F (t) are not independent, the stochastic discount exp (- for (s) ds) is 


not independent from the forward price F (T) and we cannot separate the two terms 
in the mathematical expectation. 


. We remind that the price of the zero-coupon bond is given by: 


T 
B(t,T) = E? [e Jae 


: 


The instantaneous forward rate f (t, T) is defined as follows: 


FET) - PEDUT) 


We consider that the numéraire is the bond price B (t, T) and we note Q* the associ- 
ated forward probability measure. 


(a) We have: 


OT T 


Q 


0 B (t, T) Â go s INGOLE 
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where M (t) = exp ( ys r (s) ds) is the spot numéraire. We consider the change 
of numéraire (M (t) — N (t) = B (t, T)) and we obtain: 


i T) " K m F 


NT) 
= -N(tE? [r(T)| Fl 


because N (T) = B (T, T) = 1. Since r (T) = f (T, T), we deduce that: 


8 B (t, T) o 
=B (tT) % F(T, T) F] 
(b) We have: 
f(t,T) = ee 
7 1 OB(t,T) 
B(t,T) OT 
= E% [F(T,T)| Fd 


f (t, T) is then an F;-martingale under the forward probability measure Q*. 
(c) We know that: 


c(0) = E9 [e J 1O% max (GC T) - K,0)| Fo 
= [ri max (7.7) -KOF 


Using the change of numéraire N (t) — B (t, T), we obtain: 


c(0) = E% ip max rer m -Ko | 


B (t, T) EV" [max (f (T, T) — K,0)| Fo] 


Il 


Using the Black model, we deduce that the price of the option is!: 
C (0) = B (0, T) - (ro® (di) — K® (d3)) 


where dı and dz are the two values defined previously. 


9.4.4 Change of numéraire and the Girsanov theorem 
Part one 


1. Using Itó's lemma, we obtain: 
d(X (t) Y (t)) = X (t) dY (t) + Y (t) dX (t) + (dX (t) ,dY (t) 


and: 


1 X. dY() , (dY (0 ,dY (2) 
(=a) x Y3( 


IWe have f (0,0) = ro. 
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2. Let Z (t) be the ratio of X (t) and Y (t): 


We deduce that: 


dZ(t) = X(td (=a) + ro (t) + (ax (t),d (=5)) 
_ aX), (5 ( -3X. (t) (dY (t),dY 25 


Y (0) Y? (6 Y "T 
dY (t) | (àY (5) Y (0) 
(ax o. ys()* rE ) 
dX() X(t) (AY (t) ,dY (t) 
= Fo vangdr0*X0 Ve 
(dX (t) ,dY (t)) 
Y? (t) 
and: 
dZ(t  dX(t dY() , (aY(),daY()) (aX (t), aY (2) 9 
Zi) Xt Yo. Y?*(9 X(0Y() (9.1) 
Part two 


1. The Girsanov theorem states that the change of probability only affects the drift and 
not the diffusion. 


2. We have: 
dS (t) = u(t) S(t) dt+ es (t) S (t) awe (t) 
= ph (t) S (t) dt + os (t) S (t) (4W? (t) — g (t) dt) 
= (u&(t)— g (t) os (t)) S (t) dt + os (t) S (t) aW? (t) 


If follows that: 


Or: 
g (t) — Ws (t) — Hs (t) 
cs (t) 
Using Girsanov's theorem, we deduce that the Radon-Nikodym derivative is equal to: 
dQ* 
Z(t) — 


= sw(f ato aw*(9- 5 f (6 as) 


We know that Z (t) is an F;-martingale and we have: 


dZ (t) 


zi -9(00 Wet 
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3. We have: 


Using Equation (9.1), we have: 
dZ (t) 
Z (t) 


= py (t) dt+ on (t) aW9 (t) — 


(um (t) dt + om (t) aW? (t)) + 
o*, (t) e m (t) dt 
= (uw (t) — uu (t)) dt — om (t) (on (t) — om (0) dt + 

(on (t) — ai )) aW? (t) 

We deduce that: 
g (t) = on (t) - om (t) 
and: 
UN (t) = um (t) + om (t) (on (t) — om (0)) 
. Since g (t) = ow (t) — om (t), it follows that: 


us(t) = ms (t)+9(t) os (t) 


ge 


= ps (t) + a (t) (on (t) — om (t)) 
5. We have: de dues 
($ a à 2) = es (t) on (f) dt 
and: 


) 
9.2) is equivalent to: 


We conclude that Equation 


( 
; dS (t) dN (t) 
n5 ( dt Co xU) 


We also notice that: 


j 
(0. ain - (0.20) 


= He) di em MA 


Ü' "M (t S(t)' Z(t) 
and: 2 : 
sonson (eei 
Part three 


1. Using Equation (9.1), we obtain: 


4$() _ dS() ANO, ANOANO) (AS (H AN (0) 
$() Sq) NG) N? (f) SONO 
= r(t) dt+ øs (t) dw (t) — (r (£) dt + on (t) AWS (1) + 
ow (t) dt — pos (t) on (t) dt 
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(a) We have: 
dN (t) =r (t) N (t) dt 


and: ; 
N (t) — eli r(s)ds 


We deduce that the discounted asset price is: 
S(t) = —— 
cC S r(s) 48 S (t) 


Since oy (t) is equal to zero, it follows that: 


d$ (t 
SS s cs (t) dW8 (t) 
(t) 
We conclude that S(t) is an J;-martingale under the risk-neutral probability 
measure Q. 
(b) We note: 


W° (t) = W$ (t) = Wy (t) 
The Girsanov theorem gives?: 
dw® (t) = aW? (t) — on (t) dt 


and: 
dW? (t) = AW% (t) + oy (t) dt 

We deduce that: 
En = (03, (t) es (t) on (0) at + (es (t) — on (t)) qW? (8) 


= (on (t) — os (t) on (t)) dt + (cs (t) — on (t)) on (t) dt 


where: 
6 (t) = os (t) — on (t) 


(c) Let us introduce the Brownian motion W® (t) such that: 
& (t) aW? (t) = os (t) dW8 (t) — ow (t) dW (t) 


We have: 
8? (1) = 03 (t) — 2pos (t) on (t) + 0% (0) 


We conclude that the risk-neutral dynamics of S (t) is given by: 


Q 
Um 


$m = (ox (t) — pos (t) ON (t)) dt + õ (t) dWw9 (t) 


?In Part two, we have shown that dW9" (t) = dW® (t) — g (t) dt where g (t) = oy (t) — e (t). Here, we 


assume that M (t) = 1, implying that om (t) = 0. 
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We now consider the following decomposition: 


W9 (t) = pw () + VI- PWE (t) 
where Wé (t) L WẸ (t). We deduce that: 


Fay = (oh (t) — pos (t) on (t)) dt + 
(pas (t) — on (t)) WR (t) + os (t) V1 — p? aWg (t 


Since we have: 
dW?' (t) = aWQ (t) — on (t) dt 


we obtain: 
dS (t) 
S (t) 


We notice that: 


= (pos (t) — on (t)) aW9' (t) + os (t) V1 — p2 aWz (t 


& (t) aW9' (t) = (pes (t) — ow (0) aW9' (t) + os (t) V1 — pg? aWz (t 


We deduce that: 


9.4.5 The HJM model and the forward probability measure 
1. Since we have: 


Ju 
N (t) — B(t, T3) = euh f(t,u) du 


we deduce that the Radon-Nikodym derivative is given by: 


dQ^ — MONT) 
dQ — N(T5)N (0) 
= e yj r(t) ac N (13) 
N (0) 


e7 Jy ((— (0,0) dt 


2. We have seen that the dynamics of the instantaneous spot rate is: 


r@=ro+ f (so f «o D ast f (n dW? (s) 


It follows that: 


(0-109 f (otn f otsu) au) ast fos dW? (s) 
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[een [neo 
[evn aw " dt 
J” (wee f'oto au) at) as 
fo Palet i aw? (s) 


Using Fubini’s theorem, we have: 


[ (r(t) — f (0.0) acf 


We remind that: 


a (t, T3) 


Il 
| 
= 
S 

Q 
~ 
= 

A 

S 


and: 


We deduce that: 


Tə Tə T2 
T) — m a 2 E ;42 g 
f (r (t) — f (0,t)) dt n (t, T3) dt f b(t, T5) dW~ (t) 


Finally, we conclude that: 


= 2 dhe a(t, Tə) dt f? b(t,T4) aW (t) 


3. Since the no-arbitrage condition in the HJM model is: 
l2 
a (t, To) + 3° (t, T3) =0 


we obtain: 


dQ* — ule g(t) We(t)—3 fg? (1) at 
dQ 
where: 


g (t) = b (t, To) 


The Girsanov theorem states that: 
t 
WT (y) = W2 (t) — J gs) de 
0 


is a Brownian motion under the forward probability measure Q* (75). We deduce that: 


we) (t) = w(t) — [oor ds 
0 
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4. We have: 
dw’ (72) (t) = aw? (t) — b (t, T5) dt 


It follows that: 
df(t T) = a(t,T,) dt+ c (t, T1) dW® (t) 
= a(t,T) dt 4 o (t, Ti) (awe (t) + b(t, To) at) 


= (a(t,T,) +o (t, T) b (£, T5)) dt + o (t, T1) aW" T> (t) 


Since we have: 


Ti 
a (t, Ti) 4-o (t, Ti) b(t, T9) = oen) f a (t,u) du — 


We conclude that: 


T» 
df (t,3) 2 — (s 6m f o (t,u) D dt + o (t, 3) dw® T (t) 


TY 


5. When T is equal to T1, we have: 
Ti 
i, a(t,u) du- 0 
Ty 


and: 
df (t, T1) = o (t, T1) aWQ9' (9) (t) 


We deduce that f(t,Ti) is a martingale under the forward probability measure 
Q* (71). 


6. (a) Let s € t. We have: 


B (t, T) efi (00 8 Qu) du+b(u,7) aW? (u) 


B (s,T) 

and: 

B(t,13 | . B(s,Tə) eJ. (r(u)— 3b? (u,T2)) du+b(u, T2) dW? (u) 

B (t, Ti) B (s qij eJ. (reo $9 0n) du--b(u,T1) AW9(u) 

- B (s, T2) x(s) 
B (s, Ti) 
where: 
1 E 2 2 
X(st) = £5 (0? (u, T2) — b? (u, T1)) du + 


/ (b (u, T3) — b (u, T1)) aW? (u) 
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(b) We have: . 
dWQ'(T) (t) = dW® (t) — b (t, T) dt 


We deduce that: 
1 $ 
Riley, = zi] (0? (u, 75) — 8? (u,T;)) du + 
t 
/ (b (u, T2) — b (u, T1)) aW" 9 (u) + 


J (b (u, T5) — b (u, T,)) b (u, T1) du 


UN eg 2 
= -5J (b (u, T5) — b(u,Tr))? du 4- 


ri (b (u, T) — b (u, T1)) We 7) (u) 


We notice that e* ) is an exponential martingale: 


We conclude that: 


B (t, Tə) = a B (s, T») X(s,t) 
E (t, T1) F. ao d 
B (s, T3) 
~ B(s T) 


9.4.6 Equivalent martingale measure in the Libor market model 


1. Since we have: 


B (t, T) 
SS 14 (Ti — T;) Lt, T;, T; 
B (t, Tai) ( j+1 j) (t, Jo 5i) 
= 1446;L; (t) 
we obtain: 
HUI ) _ 1 
B(tTj) | 1+6;L; (t) 
It follows that: 
B (t, Tk+1) - B (t, Tk 44) x B (tye) KX B (t, PTUS 
BT) B(t,Tk) | B(t,Th-1) B (t, Tii) 
k 
- II B (t Tj+1) 
Poo B (t, T;) 
A E 
Piso? 1+ ó;L;(t) 


2. We remind that: 
M (t) = B (t, Ti41) 
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and: 


155 


N (t) = B (t, Te+1) 


We have: 


Z(t) 


We deduce that: 


dQ* (Tk+1) 


) 


3. We have: 


L1) 


dlnZ (t) 


I 


( 


j=i+1 


B (0, Th- 


II 


j=i+1 


| 


Hd) 1+ ;L;(t) 


- M dm(1+6;L; (t)) 


d (1 + 9;L; (t)) 


Fl 


1-35; (1) 


6; dL; (t) 


Aj 


1 55L; (f) 


(t) ó;L; (t) awe (Tj41) (t) 


1 


4. We obtain: 


G 


+ ó;L; (t) 


Y (t) OL; (t) 


j=i+1 


+ dj Ly (t) 


i+1 


= (t) >> 


j=i+1 


14 


-0;L; (t) 


5. Under the probability measure Q* (Tj+1), 


dL, (t) 
Li (t) 


Y (t) 7L; (t) 


Y (t) Ly (t) 


(Tj) 
awe («0 (s 
Fenn o 54 o) 
(awe a aw? 9 (e) 


Pij dt 


we have: 


= pi (t) dt + y:i (t) awe C» (y) 
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where u; (t) = 0. In Question 5, Part two, Exercise 9.4.4, we have shown that: 


dL; i * 
p e = uia (£) dt +74 (£) awe (y) 
where: 
Li 
i (O dt = p(t) dt (50 amz (0) 
Li(t) ' 
We deduce that: 
EZ 
pik(t) = 253 pig Vy (t 
Farro )125, 14 ;L;(t) 
"m 0Y EN a 
A um + (Tj41 — Tj) L (t, Tj, Tj41) 
6. If Tk+1 < Ti41, we have: 
B (t, Tk+1) B (t, Tk41) x B (t, Tk42) XX B (t, Ti) 
B (t, Tii) B(t,Tr+2) Bt, Tk+3) B (t, Tii) 
E II B (t, T;) 
j=k+1 B (t, Tj+1) 


j=k+1 
We deduce that: 
B (0, Ti+1) t 
Z(t) = 146; 


It follows that: . 
: AU ;b; (t * (rr. 
dinZ (t) = Y (t) SL; (t) aw? B+ (y 
jára oos) 


and: 


We conclude that: 


=% (t) Pij (t) Sz Ly (t) dt + yi (t) awg (Te) (t) 
j 1+ jL; (t) 
j=k+1 


9.4.7 Displaced diffusion option pricing 
1. We have: 
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We deduce that: 
bS(t) = a(t) + àB(t) X (t) + B (t) u (t, X (t) 


= a(t) + &B(t) ee W- ew) -B(t)u (« BA: a) UM 2j 


2. We have: 


ist) = aat +0.) (AE) + nay (AO) 
= ara (t) - (AO + il «(S25 eu) st 


B (t) 
meaning that: 
ee Oo (t ) - (B® 19.8 (t + 1(t)) a(t) =0 
B(t) B (t) + u(t) = 
We deduce that: 


and: 
a (t) = ape” 
We also have: 0,8 (t) 
t 
be 
AG) u(t) 


and: 


B(t) = Boedo 6-9) ds 
3. We deduce that: 
dS (t) = bS (t) dt + B (t) ^" c (t) (S(t) — a (t))? aW? (t) 
4. We have: 
dX (t) = u (t) X (t) dt + o (t) X (t) aW? (t) 


and: ; ; 
X (e Sel Quo 3e (0) ae [ome 


By noticing that So = o + fo Xo, it follows that: 


a (t) 4- B (t t) Xpedo ” (&(s)- $e? (s)) ds fo a(s) qW?(s) 
Je d (b- ic? (s)) as-- f? a(s) dW°s) 


S (t) 


I 


= age? * (So — ao 


5. The payoff of the European call option is: 
f(S(r) = (s(r- K)" 
= ((S (T)— age”) — (K — aoe?T))" 


It follows that the option price is equal to: 


t 
C (to, So) = Cas (s — ap, K — agePT, J c? (s) ds, T, 2 
0 


157 


(9.3) 
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9.4.8 Dupire local volatility model 


We assume that: 
dS (t) = bS (t) dt + c (t, S (t)) S (t) aW? (t) 


1. Let C (T, K) be the price of the European call option, whose maturity is T' and strike 
is K. We remind that: 


Y^ (T, K) K?02.C (T, K) — bK OKC (T, K) — 
rC (T, K) + (b— r) C (T, K) 20 


We deduce that: 


A'(T,K 
2 EN , 
o (T, K) I B! (T, 
where: 
A' (T, K) 2 2bKôxC (T, K) + 20rC (T, K) — 2 (b — r) C (T, K) 
and: 
B' (T, K) = K°8%C (T, K) 
2. We have: 
C (T, K) = Soet TE (d1) — Ke" (d;) 
where: 
1 So 1 
di = 1 EXPE OP ROT 
i sawek) ) aa 
and dy = dı — X (T, K) VT. We note Cgs (T, K, X) the Black-Scholes formula. We 
have: 
OKC (T,K) = OxCss(T, K, (T, K))+ 
ôr? (T, K) Oz Css (T, K, X (T, K)) 
and: 


OLC(T,K) = Ces (T, K,E (T, K)) + 
20xD (T, K) 8$ «Cs (T, K, X (T, K)) + 
O&X(T,K)0zCsas (T, K, X (T, K)) + 
(8k X (T, K))? &Cps (T, K, X (T, K)) 


The derivative of C (T, K) with respect to the maturity T is equal to: 


OrTC(T, K) = OrCps (T, K,E (T, K)) + 
Oru (T, K) 0g Cps (T, K, X (T, K)) 


The different Black-Scholes derivatives are 


OKC 
sC 


SEG, 
32C 
02 KC 
arC 


We deduce that: 


A! (T,K) 


where: 
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BS (T, K, X) 
BS (T, K, X) 


BS (T, K, X) 


BS (T, K, X) 


BS (T, K, X) 


BS (T, K, X) 


Il 


3. 


= —e "To (d2) 


= Soet 


eT 


eT 


eT 


“IT V T6 (di) = Ke" VT (dz) 
(dz) 
KSVT 
K V/Tó (d2) did» 
23 


dió (dz) 


»» 


= (b-r)Soe^-?Tà (dy) + 
Ke (r (d2) + =e a) 


2/T 


—2bKe-*1 6 (dz) + 2bK7e-" VT 6 (d2) OK (T, K) + 


2(b—r) Soet - ® (di) + 


fhe (ra (d2) + 


X (T, K) ó (d2) 
2/T ) 


2Ke^'T /T$ (dz) Oru (T, K) — 
2 (b — r) (Soe? Po (di) — Ke" (d2)) 
= 2bK?e-'T V/Tó (d2) OkX (T, K) + 


Ke-'T Y (T, K) ¢ (de) 


—rT 


A(T, K) 


We also have: 


BER) 


—rT 


VT 
Kọ (dz) 


X(T,K)VT 


+ 2Ke"! V Tó (da) 8rX (T, K) 


A(T, K) 


X? (T, K) --20KTY (T, K) OkX (T, K) + 


2TY (T, K) 


Kọ (d2) 


ard (T, K) 


-rT K?dió (dz) 


EX(TK)/T STK) 


eT Ke V/T¢ (d 


—rT 


OUT, K) + 
eT K? VT (d2) dido 


(8gX (T, K)? 


X(T,K) 


Kọ (d2) 


£ (T, K) VT 


3We use the fact that: 


Sog (d1) = K 


B(T,K) 


e PT $ (d2) 
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where: 
B(T,K) = 1-2KwVTd;OgX(T, K)+ K?TX (T, K) 82. (T, K) + 
K?Tdd» (0kX (T, K))? (9.5) 
We conclude that: 
/ 
o’ (T,K) = S 
B' (T, K) 
_ A(T, K) 
Cv B(T,K) 


where A(T, K) and B (T, K) are given by Equations (9.4) and (9.5). 


. We follow the proof given by van der Kamp (2009)*. Let f denote the discounted 


payoff function: " 
£055) Set (Sr) eR)" 


Itó's lemma gives: 
df(T,8) = -re T-t) (S — K)* aT - bSe "7-118 > K} dT + 
a^ (T, S) Ste" 6 (5 = K) dT + 
c (T, S) Se 1-91 (8$ > K} aW? (t) 


where 6 (x) is the Dirac delta function. We deduce that: 


: [df (T, S (T))| Fe] 


drC(T,K) = dm 
= re U-ÜKE[1(S(T) > K} F+ 
(b— r)e-"C-?E[S (T) 1 (S (T) > K} FA) + 
je "("-)g [o? (T, S (T)) S? (T) 8 (5 (T) - K)| Fi] 
. We have: 
C(T,K) = E[f(r.s(r)| X] 


e" (T-OE[(S(T) — K)1{S(T) > K}| Fi] 


It follows that: 


OkC(T,K) = -e"C-?E[1(S(T) > K}| Fi] - 
e-'T-DE[(S (T) — K) (S(T) — K)| Fi] 
= —e "(T-t [1 {S (T) > K}| Fi] 


— 


and: 


O&C(T, K) = eC E[8(S (T) - K)| Fi] 


We notice that: 


i [df (T, S (7))| i] 
dT 


arC (T, K) = 


4van DER Kamp, R. (2009), Local Volatility Modeling, Master of Science Dissertation, University of 
Twente. 
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Since we have: 


L[S (T) 1 (S (T) > KY =e"? -%¢ (T, K) + KA(S (T) > K} 


we obtain: 


OrC(T,K) = re" TO KE[1(S(T) > K} F] + 
(b-r)E [e (T, K) +e” T-9K1{S(T) > K)}| | + 


TOGT) 


We also have: 


Q(T,K) = E[c?(r,S(r) S? (T)| S(T) = K] E[6(S(T) - K)| Fd 
= E[o? K] K?e" (7-992. C (T, K) 


Ss 
U 
3 
U 
È 
II 


We conclude that: 


OrC(T,K) = rKôxC(T,K)+(b-—r)C(T,K)- 
(b— r) KOkC (T, K) + 


2 [o° (T, S (T))| S (T) = K] K?8;,C (T, K) 


NI = 


and: 


Lo? (T, K) K?07.C (T, K) — bK OKC (T, K) — 
rC (T, K) + (b-r)C(T,K)=0 


5. (a) Since we have: 


we deduce that: 


and: 


We also notice that: 


dido = 


1 
£2 (T,K)T 4 


(b) The first derivatives of x = ọ (T, K) are equal to: 


1 


and: 
re (T, K) =b 
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It follows that: 


OkE(T,K) = O«X(T,9(T,K)) 
1. 
= -g> (TF, 2) 
and 
OrE(T,K) = drd(T,~(T,K)) 


Ard (T, x) + bd,» (T, x) 


We also have: 


OLX OW 303 (T, x) + RËS (T, x) 


(c) We deduce that: 


where Å (T, £) = A (T, K) and B (T, <) = B (T, K). Using Equations (9.4) and 
(9.5), we obtain: 


A(T,2) = X?^(T,z) 2bTÝ (T, x) ôs% (T, £) + 
2T (T, x) (O73 (T, x) + 50,9 (T, x) 
= (7,2) +2TS (T, x) Ord (T, x) 
and: 
B(T,2) = 1-2 (= (T, £) + PTÉ (T; 2) 8,5 (T, £) + 
TY (T, x) (8,5 (T, x) + 025 (T, 2)) + 
(22 (T, x) — "o (T, »)) (8,5 (T, a)) 
= (1-z$-!(T,z)8,X (T,a)) + 
TÈ (T, 2) 029 (T, x) — 


E (TS (T, x) 4,2 (T,2))- 


ge 


(d) When T is equal to zero, we obtain: 


a? (0, c) ex - X? (0, x) — 5 
(1— zX-! (0,2) 8,3 (0, 2)) 
and: ae 
Š (0,2) = E - aa) č (0,2) 


The explicit solution of this equation is: 


—1 
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We deduce that: 


ie yo? (0, ry) 0,6 (0, xy) dy 


0,3 (0,2) = - 
(hë 6 (0, xy) dy) 
It follows that: 

1 

: d 

8,5(0,0 = Jo VV s (0,0) 

1 

(ic dy) 

= —0,6 (0,0) 


9.4.9 The stochastic normal model 


1. We have?: 
EN(T,K) = Xg(T,K)V/HRKx 
1+ din? Fo/K + Ad; ln* Fo/K 
1+ d (1- d; In? Fo/K) X2, (T, K) T + gd, X (T, K) T? 
2. We have®: 
EN(T,K) = ae(FyK)"? ES x 
x (z) 
1+ di ln? Fo/K + qd; In* Fo/ K 
14+ di (1—- 8) In? Fo/K xd; (1 - B) In* Fo/K 
—6 (2 — b) a? 2 — 3p? 
ad pew UE i m d PT 
24 (FoK) 4 (FyK) 24 
where: F 
A= ud (FaK) ETO i 
Q 
and 


x(z) =In ii ous i 
Ip 


In the sequel, we introduce the notation y(z) = y1 — 2pz + z?. 
3. When is equal to 0, we obtain: 


yr jeg (=) (1+ Avr) 


F 
zz E IK 
a K 


x(z) =In [Tua 
Ic 


where: 


and: 


5Hagan et al. (2002), Equation (A.64) on page 101. 
6Hagan et al. (2002), Equation (A.69) on page 102. 
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4. Since we have: 
lim —~~=1 
K>Fo X (z) 


we deduce the expression of the ATM volatility: 


2— 2 
S.A Se (1+ t zr) 


5. We notice that z is a function of K. By introducing the notation z = z (K), we have: 
—2p0K z(K)--2z(K)O8kz(K) 
2/1-2pR(K)RzR(K) xz (K) 
VA 39 (K) € (K) +2(K) p 
Okz(K) 
VA ps (K) E) 


ükx(z(4) = 


and: 


ðkz(K) = Z JFK (VK (ny - m K)) 


| v In Fo VK hK 
- tun (Be : iJ 


It follows that: 


where: 


We deduce that: 


OkEN(T,K) = o (1 een 7) Bk (a) 


24 z(K)) 
2-3p , Fo 2 
= (1+ 24 2r) EIER) 
1 z 
(9.6) 
(zs suat] 
6. We have: 
C (T, K) = (Fo — K) (d) + on V TO (d) 
where: 
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7. Using the results of Breeden and Litzenberger (1978), we have: 


Q(T,K) = Pr{F(T)< K} 
OC; (T, K) 
— 1 — 
+ aK 
When oy is equal to the function Xy (T, K), we deduce that: 


Il 


Q(T, K) 1 — 9 (d) + (Fo — K)ó(d)-Okd-- 
V TÓ (d) -OkXw (T, K) — Xy (T, K) V Td (d) - Oxd 
1 — (d) + 6 (d) VT - «=n (T, K) 


where Ox Un (T, K) is given by Equation (9.6). 


8. For the density function, we have: 
q(T,K) = -—ó(d)-Okd — dó (d) VT -Okd-OgXw (T, K) + 
6 (d) VT -O&Y (T, K) 
We notice that: 
—Xwy (T, K) VT — (Fo — K) VT - Og =n (T, K) 


EE X2 (T,K)T 
|..— l-«dVT-ÓOkXw(T,K) 
It follows that: 
$ (d) 3 
T,K) = 1 dVT-OgXwN(T,K)) + 
(E) x xy (T, K)) 


(d) VT - 82. Xy (T, K) 


To calculate the probability density function of F(T), we need to calculate 
Ox™=n (T, K) and 02,Xy (T, K). If we use the approximation z = vo !(Fo — K), 
we have Ógz = —va-1. We deduce that: 


_ 9,2 
ükXx (T, K) = —v (1 a 5 = 3p zr) 1 z 
24 x(z) x2(z)4/1-2pz + 2? 
and: 3 j 
2 2 2— 3p" » 
Ox En (T, K) = — (1+ VT) D 
where: 
D = 2 z ys 
X? (z) (1 — 2pz + 2?) V x (z) Vi — 2pz + 2? i 
z(e- p) 
x2 () (1 = 2pz + 22)? 


9. When f = 0, the SABR model becomes: 
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Since we have a (0) = a, we obtain: 


a (f) = ac BHP 


and: 
dF (t) = ae- 3^ ts qw (t) 
It follows that: 


t 
F()=Fyta T eE stuw?) aw (s) 
0 


Using the scaling property, we deduce that: 


vt 
F (t) = Fo + J e7 35*Wa(s) dw, (s) 
0 


where Wj (t) and W> (t) have the same properties WẸ (t) and WẸ (t). 


We note: 

d 1 

X(t) = | e 28+ W2(8) AW, (s) 
0 
Me (t) = e 3etraWa(t) 
M(t) = e 2tt Walt) 
We have: 
dX (t) = e-3** W2@ qw; (t) = M(t) aW; (t) 

and: 


d (X (t)) =e?" dt = M(t)? at 
Using Itó's lemma, we deduce that: 


n (n — 1) 


dX” (t) = nX"(t)dx (t)+ A77 (t) d (X b) 


= nX- (e) M (t) dw; (t) + r= 


Since we have: 


t) = a (t) dt -- aM? (t) dW» (t) 


we obtain: 


A(X (HAMA) = X"(t) aM? (t) 4- M" (t) aX" (t) -a(X" (t), M° (t)) 
= Ed (t) M° (t) dt + aX" (t) M° (t) dW. (t) + 
nX?-1 (t) M° (t) M (t) dW, (t) + 
n (n — 1) 
2 
npa X"! (t) M° (t) M (t) dt 


X?-? (t) M° (t) M (t)? dt 


Model Risk of Exotic Derivatives 


It follows that: 
La(x"(t)M*())) — 
dt 


a (à 


n(n—1 
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— 1) 
2 


[X (t) M* (t)] + 


— 


; E pz ? (£) M* (t) M (0? + 


npa. 
We notice that M* (t) M (t) = M**! (t) 
that: 
dy (t) 
dt 


a(a — 1) 
2 


wre (1) + 


where: 


i [X"—! (t) M? (t) M ()] 


and M“ (t) M (t)? = M**? (t). We conclude 


-1 
n(n ) ynr—2,042 (t) ait npayr— Lari (t) 


2 


yn (t = 


l [X (t) M° (t)] 


Therefore, the relationship between "^^ (t) and the moments of F (t) is: 


a n A n 
(E () - Fo)”] = (Z) w^ (20 
11. For n = 0, we have: 
Y(t) = Ex (t) M° (t)] 
= le Jer 
LI eT zatezna t 
= e32(a-1t 
For n — 1, we have: 
dV (t) a(a — 1) .i 0 
= Wt (t) + pay?! (t 
~ ot (0) + pawn (t 
= ao 1) wha (t) NS pae ^t 0t 
We deduce that: 
yha (t) = Bier] e 244-1) nae? (a--1)s ds 
0 
= pera Dt [e"s] 
= pezua-Ht (e*t = 1) 
For n = 2, we solve the ODE: 
ya (t —1 
d ( ) = a(a ) y2a (t) if’. 2pa o1 (t) i y0a+2 (t) 
dt 2 
—1 
- aa ) w?* (4) + h (f) 
"We remind that the solution of the ODE: 
df (t 
FO — af) +8 
is equal to: 
t 
f(t)-2 e e ?^*8(s) ds 
0 
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where: 


(a+1)(a+2) 
2 t 


h (t) = 2pae t (Ti = 1) +e 


The solution is given by: 


t 
W(t) = S. "r^ 5h (s) ds 
0 


» a(a—1), 2p?a +1 (eer — )- 2(/ at . 
eb (Gselc SEERE 


For n = 3 and a = 0, the ODE becomes: 


dY? (t) 


pi? (t 
qi 3 (t) 


3pe' (e? — 1) 


Il 


The solution is then: 
P?O (t) = p (et — 3e* + 2) 


For n — 4 and a — 0, we obtain: 


dy*? (t) 
E 6W?? (1) 
4p? 4-1 
= 6e! (( E — ) (e*t 1) 2p? (e? — )) 
and: 2 
4+1 
40 (t) = 22 — e — Ap*e* : (Ap? + 1) e +. 12p?e* — Ap? +1 
We deduce that: 
1 
[F-m] = (=) wl (vt) 
V 
= 0 


and: 


i [(F (t) — 8? E OECD 


For the third moment, we obtain: 


[(F@ - Ay 


Il 
ZU 
| 
Set 
w 
bz 
id 
ud o 
Lum3 
X 
N 
~ 
Nl 


Finally, the fourth moment is equal to: 


[n - 8 z (2) wh (20) 
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12. We have: 


N 


o(F(t)) = 


S| R, ES Q 


1 1 
bs Ape ge o cT 
( Tv 3" te" 


1 1 
2 242 448 
t+ -v't -vt 
a ( + 2 vt + 6 V ) 
Using the same approximation method, the skewness coefficient is: 
p (e^ à 3e^t zb 2) 


(e?t mm 1)? 


3pvv/t + Apv?t/t 


i CF (t) 


K 


whereas the expression of the kurtosis is: 


(4p? + 1) e?! — 209? (e°! +1) + (369? — 6) e^t +5 
5 (e^t — 1y 

3+ (7+ 11p?) 2t 

(1+ int e 40442)? 


(F) = 


13. (a) Using the formula given in Question (1), we obtain the following equivalent nor- 
mal volatility: 


K 7% 10% 13% 
Su... 30% 20% 30% 
En (T, K) 2.51389% 1.99667% 3.41753% 


(b) The method of least squares gives o = 0.017573, 8 = 0, v = 1.448791 and 
p = 0.383867. We verify that the fitted smile adjust perfectly the three observed 
volatilities. 

(c) The cumulative distribution function is shown in Figure 9.1. We notice that 
the cdf is not an increasing function when the forward rate is close to zero. 
As a result, the density function takes negative value. We deduce that there is 
arbitrage opportunities. 

(d) Using the formula calculated in Question (8), we obtain Figure 9.2. With the 
approximation y FoK In x ~ Fo — K, the probability density function becomes 
always positive. 

(e) The skewness is equal to 10.43, whereas the kurtosis is equal to 1822.60. These 
values are very high, meaning that the stochastic normal model is far to be 
Gaussian. This result is surprising. However, we can show that the long-term 
probability distribution of F (t) in the SABR model is non-degenerate contrary 
to Black and normal models. For instance, when p is equal to zero, we obtain: 


1 
2712 


F (%0) © Ry + ZN (0,1) 


where Z, is a Gamma random variable with parameter k. 
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1.0rF 
PDF 
50 
40 
os, * 
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FIGURE 9.1: Cumulative distribution function of F (1) 


ol 


0 5 10 15 20 
Forward (in Z) 


FIGURE 9.2: Probability density function of F (1) 


Model Risk of Exotic Derivatives 171 


Remark 1 To find the distribution of F (co), we use the following result® of Donati-Martin 
et al. (2001): Let € (t) = — (ct + cB (t) + N* (t) — N- (t)) where N+ (t) and N- (t) are two 
independent Poisson processes and B (t) is a Brownian motion. Let A (t) = hh exp (£ (s)) ds, 


X (t) = exp (E (t)) [^ exp (—£(s)) ds and T4 denotes an exponential variable of parameter 
a independent of £(t). The law pa of A(T) satisfies pa = o^! £*ua where L denotes 
the infinitesimal generator of the Markov process X (t). Let us consider the special case 
£ (t) 2 c B (t) — ct. We have: 


2 


dX (t) = oX (t) dB (t) + ((S-«)x@+1) dt 


c? j (12,243 
£=((5 c) 1) a gozo? 


We deduce that the density function of A (oo) is equal to: 


0* 0 
fato) u) = T (k) ukt exp (-5) 


where 0 = 2/07 and k = 2c/o°. Therefore, we have: 


law 0 
A(œ) = Z 


where Zy, is the Gamma random variable with parameter k. In the stochastic Gaussian model 
with p — 0, we have: 
{ dF (t) = a(t) F (t)? aW (t) 
do (t) = va (t) aW? (t) 


where we (t) and we (t) are independent. It follows that: 


1 
a (t) = a exp (-77 +vdwe W) 


f 1 
F() = Feta [ exp (- 5) vam (9) aw?) 


We deduce that F (t) BY Fo oW? ((F (t))) where: 


We have: j 
law 
F = — 
(F (o) = 7. 
where 0 = 2/(2v)? = 1/ (277) and k = 2v?/ (2v)? = 1/2. Since we have we (t) aw 
N (0,1) Vt, we conclude that: 


law 1 
F (œ) = Fo + 1 I. 
(69) PEG) V 2v? Zi y» 


8Donati-Martin, C., Ghomrasni, R., and Yor, M. (2001), On Certain Markov Processes Attached to Ex- 
ponential Functionals of Brownian Motion; Applications to Asian Options, Revista Matematica Iberoamer- 
icana, 17(1), pp. 179-193. 
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9.4.10 The quadratic Gaussian model 
1. We know that the bond price: 


B(t,T) = 


Q e JE r(s) 4 
is the solution of the following PDE: 


st (x (t) 2 B (t, T) X (07) dE 


Ox B (t, T)! (a(t) + B (t) X (0) 
8,B (tT) — r (t) B (t, T) 


with B (T, T) — 1. 
. We assume that the solution of B (t, T) has the following form: 


B(t,T) = exp (-à(t,T) - 8,7) X (0 - X (0 f (5T) X (9) 


where Î (t, T) is a symmetric matrix. We obtain: 


O,B (t, T) 


BET 7 OB (t, T) — AÊ T) XH- X4 Ore X 00 
and: 
ƏxB(t T) 4} a 
Bay = zu T) — 2Î (t, PX) 
We deduce that: 
82. B (t, T) Ox (oxB (t, T)") 
BT) ` B (t, T) 
= -oP(tT)- 
A ^ EE 
(8 (t, T) + 20 (t,T) X ©) (ô (t, T) + 2P (t,T) X ©) 
= Ô (t, T) + ÊT) ÊT)" + 
Ô (T) X (9B T) + 
280 T) X (Y Î (t, T) + 
AP (t, T) X (t) X (t) P (t, T) 
By using the matrix property tr (AB) = tr (BA) if the product BA makes a sense, we 


can write Equation (9.7) as follows: 


-tr (ERE, T) E (8) 


P. 
Ps 


str (5) AT) BUT) zo 
HXO te T)E()x (0 BLT) 
2x () ETE ()x (0 HT) X ()— 
(Bur) +22) XW) (al) + B(0X () 
à (tT) - X(t)" àB(,T)- X(t)" à a uk (t) 
(e) - X (0' 8(0- X X (t 


) = 


(9.7) 
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We regroup the terms by the polynomial degree in X. For degree 0, we obtain: 


-tr (SPELT) EO") + 


Le (3(1) B(t,7)B (7) B()") - 
ÊT) a(t) -à&(t,T) -o(t) = 0 
Bât T) = -tr (EEO HT) -ET a(t) + 
ETT B()B()" B(,T) - alt) 


or: 


àB(tT) = -—B(t)' B(t,T) - 2€ (t, 7) x (0 X (0! Â, T) — 
2f (t, T) a (t) — B (t) 


For degree 2, we obtain: 


PTE (HE) Pu T) -3P6 T) B (t) - 


àfb(tT)-r(t = 0 
or: 
àb(ur) = sb (5T)x()x(0 P(nT)- 
ar (t, T) B (t) -T (t) 


. B (t) must be a diagonal matrix in order to ensure that Î (t, T) is a symmetric matrix. 
Indeed, if we do not consider this hypothesis, we obtain: 


aÊ (t, T) = ; (f (t, T) +P (t, T)') ODON (f (t, T) +P (t, n" = 


(Pe, T) - P (,7)") B(5 - T (0 
It follows that the term (f (t, T)+Î (t, 2 B (t) is not symmetric. 
. We recall that: 
dX (t) = (a (t) + B (t) X (t)) dt + X (t) aw9' C? (y) 


where: 


and: 
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It follows that: 


z x t z 
X(t) = POX + eh | e B(9)4 (s) ds + 
0 
t 
ge e- 29) (s) dw") (s) 
0 


We conclude that X (t) is Gaussian under the forward probability measure Q* (T): 
X (t) ~ N (m (0,t) , V (0,4) 
We have: , 
m (0,t) — eP x, + i e Bg (s) ds 
that is the solution of the following EDO: f 
ðm (0, t) à (t) + B(t)m(0,t) 
a(t) -E (E) E(t)" b(t.) + 
B(t)m(0,t) — 2X (t) X (t) Î (t, T) m (0,1) 


I 


I 


We also have: 


Or: 


m z t 2 = 
e BOY (0,t) e750" =} e-PGY (s) E (s)'e B(s)! gg 


It follows that: 


-B (t) e- BO vy (0,t) e- BO" 4. 
e PO, (0,t) e- BO" — 
e BOY (0,2) B(t)! e-B0' = e-BOy (gy)! e-807 
Or: 
aV (0,5) = B(QV(00)-V(0,) Bt)! -xa)x(o. 


B (t) V (0,t) + V (0, t) B(t)! — 
AV (0D T GT) SG) EG) HEHEA! 


In our approach, the dynamics of m (0,t) and V (0, t) are obtained under the forward 
probability measure Q* (T). In the paper of El Karoui et al. (1992a), the dynamics of 
m (t, T) and V (t, T) are obtained under the probability measure Q* (t, T): 


Opm(t,T) = a(T)+B(T)m(t,T) —2V (t, T)T (T) m(t, T) — 
V(t, T) B(T) 
rV (t,T) = V(t,T) B(T)' - B(T)V (&T)— 


VETO) (P) x ey 0. 


. The Libor rate L (t, T; 1, T;) at time t between the dates T; ; and T; is defined by: 


1 [BUT 
Least) = go (Bem) 


where 6;-1 = T; — Tj... 
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. The payoff of the caplet is given by: 


f(X) = SL nan) -X) 
B (t, T; i) T 
= Gon ~(1+ 6.440) 


= DIOE (B (t, i1) - (1+ 61K) B (tT) 


It follows that the price of the caplet is given by: 


— Ti p(s) ds 
Caplet = ue aac ~(14+6_.K)B,T))t 
aplet J BET) (B (t, Ti-1) — (1 + ĝĉi—1 K) B (t, T;)) 


= B(0, 4E” O (B, Ti) - 1-6: K) BT)" | 
= B(0,t) E? ® [max (0, g (X))] 


where: 
g(x) = exp (—â (t, 7-1) - Ê (t, Ti-1) 2 - Î (t, Ti-1) 2?) = 
(1 + 5;-1K) exp (-a (t,T;) — B (t, T) x — Î (t, T) 2?) 


. We have: 


II 


Caplet B (0,¢t) |. f (x) à (z; m (0,0) , V (0,2)) dz 


oo 


= B(0,t) S RCO CAO r Ood 


oo 


= B (0,0 | g0) 4 (2; m (0.0 V (0.0) dz 
= B (0.1) f h(x) dz 


where E = {x : g(x) > 0} is the exercise domain of the option and: 
h(x) = g (x) 6 (x; m (0,1) , V (0,1) 
We note a; = Î (t, T;), bi = B (t, T;), ci = â (t, T;) and d = 1 + 6; 4 K. It follows that: 


g(z)>0 e exp ( a; ir? — b; im ci-1) > dexp (-aja? — biz — ci) 
& a;ar?-cb; qrcca €au-4465-4c;—1nd 
e ar+br+c>0 
where a — Qi — Qi—1, b= bi = bia, C = Cj — Cj] — Ind. Let A = p? — 4ac be the 


discriminant of the quadratic polynomial. If A < 0 and a > 0, E = (—oo, +00). If 
A<Oanda<0,€=0.If A>0 and a> 0, € = (—00, x4] U [z3, +00) where: 


—b — Vb? — Aac 
2a 


—b + Vb? — 4ac 
2a 


Xi = 
and: 
T = 


If A >Oanda <0, E = [21, x9]. 
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8. We have: 


1 
P(x) = 2V (£ — m)? — ax? — br — c 
1 2 
= x” 4 ee a az? — ba — c 


1 2 m m? 
= -|55 b 
PEDE «(T ja (r +e) 


1/1+2aV\ 4. (m-—bV m? 4- 2cV 
Lope RE x 
2 V V 2V 


We can write P (x) as follows: 


1 bj 2 
P(xy— a + Zr- 8 
2V V 2V 
1 
Boe (x—m)-e 
where: 
E V 
V — 
1+2aV 
E m — bV 
"e = Goa 
- m? 4- 2cV m? 
G = > 
2V 2V 
We deduce that: 
T2 
J -— J l en 3v (0-9 -6 dge 
zı vV2nV 


1 m 
V 1x 2aV* ( 


We also have: 


and: 


[er 
I 
Lem 
5, 
+ 
N 
eo 
X 
NS 
à 


m?(142aV) | (m—bVy 
2V (14-2aV) 2V (1+ 2aV) 
2am? + 2bm — P? V 
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9. The price of the caplet is equal to: 


Caplet = B(0, Df h(a) da 
E 
= B(0,t)Ti-1 (E) - B (0,t) (1 + 6:-1K) T; (E) 
where: 


L; (€) = | e- &(6T)- B(T)a- FTO 6 (v; m (0, t) , V (0,t)) dz 
€ 


Since we can write Z; (£) in terms of 7, we obtain an analytical formula of the caplet 
price. For instance, if E = (—00, x1] U [z2, +00), we have: 


Tie = J (a(t, Te) B (6) P (5,7) m(0,5 V (0,5 ,—09,21) + 
I (& (t, Ti), 8 (7) F(T) m (0,5 , V (0,1) 42, +00) 


If E = [x1, x9], Ti (£) becomes: 


L (E) =F (a(t T) (7) LT) m (0,0 V (0,0 21,22) 


9.4.11 Pricing two-asset basket options 


2: 


1. Let f (Sı (T), S2(T)) = (o491(T) + a282 (T) — K)* be the payoff of the option. 
Using Feynman-Kac representation, we know that: 


Co = E9 r So re pg, (T), S2 | 


where: 
Si (T) = $1 (0) e(bi 881 )Tm V Tei 
So (T) m So (0) e(b2— 302) T+o2VTes 


and (€1,€2) is a standardized Gaussian random vector with p(€1,¢2) = p. Since the 
probability density function of (€1,€2) is equal to: 


1 — zr dom (#1 +23-2p0122) 
h (x1, £2) = —— eicere 
(21,22) 2m4/1— p? 
we have: 


Co = e77 n g (£1, £2) h (£1, £2) dz; dz2 
R2 


where: 
g (21,22) = (g1 (z1) + g2 (22) - K)* 
gi (a1) Es, a1 $1 (0) (b1-407)Tt+oiVTx1 
ga (12) = a282 (0) (02-373) Tem Tos 


(a) Since we have: 


1 
Ac _D>0ex>-n—-—- 
c 


2.| D 
ole 
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we deduce that?: 


:|(Ae** - D)*| 


Il 
y 
— 
D 
® 
g 
+ 
Q 
8 
iw 
— 
+ 
© 
oN, 
8 
— 
(em 
8 


where: 


~ 
c 
wa 


If A « 0 and D > 0, we have: 


and: 


s [ (Actes es D)'] —0 
If A « 0 and D « 0, we have: 
(Aerts — p)* = (-D + Aet) t 


and: 


) [Aet z D)'] = —D® (-d, — c) + Ae^*à* ð (—d,) 
If A » 0 and D < 0, we have: 


+ 


(Ae? + _ D) = Acbtes -D 


and: 


J [aet — D)'| = Agi p 
3. Using the Cholesky decomposition, we have: 


E2 = p£1 + V1 — p?ea 


where (€1, €3) is a standardized Gaussian random vector with p (£1,£3) = 0. We deduce 
that the pdf of (1,63) is given by: 


9We recall that: 


Model Risk of Exotic Derivatives 179 


Therefore, we have: 


Co = eT ni g' (£1, 23) h (£1, £3) dry das 
R2 


where: " 
g' (21,23) = (gi (£1) + g5 (21, 23) — K) 
gi (21) = aS; (0) e he Tem VT 


, 
1 € 
gl (21,23) = a282 (0) e228) Tori Tr I Pros 


It follows that: 
m Jes (f (95 (z1,23) — (K — gl (z1)))* 6 (23) dza) (21) day 
where: 
92 (21,23) = (oss; (0) euam) x 
c (02 402033 (17 2?)o3)T-- /1- Pos Tus 
Since we have $5 > 0 and K* > 0, we deduce that: 
Cy = [ BS(S*, I o Tr) (21) dey 


where: 
S* = a3S$» (0) eP92 V Tz 
K*= K- 01.91 (0) e(i- 30i )Tr V Te 
o* = oy 1 — p° 
b* = bo — ip?o2 
In this case, the Black-Scholes formula is equal to: 
BS (S*, K*,o*, T, b*,r) = Ste -To (dj) — K*e-'T E (da) 


where: 


and: — 
d» = di = o* VT 


. Ifa, > 0, ag < 0 and K > 0, we obtain the same formula: 
Co = i BS (S*, K*,o*, T,b^,r) b(a1) day 
R 


with: 
BS (S, K,c, T,b,r) = —S*e(*' T$ (—d,) + K*e-'1 ® (—d;) 


. In the general case, we can obtain the following options: 


S* k*j* ( 
m d (put) 
[S7] e ( 
0 (0 


where S* > 0 and K* > 0. Table 9.1 shows that we cannot always transform the 
two-dimensional integral into a one-dimensional integral. 


2 | ( 
2 | ( 


[ası (T) + agS2 (T) — K)* = 


TABLE 9.1: Pricing basket options with one-dimensional integration 


Case a, ag K Type 9^ K* 1D 
#1 S ST E a (call) 0161 (T) xp a2 S2 (T) K 

#2 EE T m (e) 0194 (T) T 0295 (T) —K "4 
#3 + -— + (call) 0154 (T) K-a (T) v 
#4 + -— -— (call) ası (T)- K —a26$» (T) 

#5 — + + (cal) 2255 (T) K-a (T) v 
#6 = T — (call) a2 Sg (T) —K —o164 (T) 

#7 — — + (0) "4 


#8 — -— -— (put) o494(T) -- o5S5 (T) K 
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Statistical Inference and Model Estimation 


10.3.1 Probability distribution of the t-statistic in the case of the linear 
regression model 


—1 


2 
1. We verify that HT = (X(X'X) ! X") -X(X'X) XT =H and: 


Ix 


H? = X(X'X) X'X(X'X) 
= X(X'X) X 


Since I, is symetric, we also deduce that L = I, — H is symetric and idempotent: 


I? = (L-H)(L.-H) 
= L-2H4H? 
= I,-2H+H 
= L-H 
2. We have: i 
LX = (In -X (XX) X')X-X-X-0 
and: 


X'L-(L'X) -(LX) =0 


We notice that: X E 
=X$=X(X'X) X'v-HY 


and: 
U-Y-Y-Y-HY-LY 


We deduce that: 
Ô -LY-L(X6--U)-LXf-4LU-LU 
3. We have: 
trace(L) - trace (In -x -X(X'X) a ) x7) 
= trace(L, ) - trace (X (X'X) x") 
= trace (In) — trace (QC x) : XTX) 


= trace (I,,) — trace (Ix) 
= n—K 


We know that the rank of an idempotent matrix is equal to its trace. We deduce that 
rank (L) = trace (L) =n — K. 
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4. 


Or 


6. 
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We have: : XM 
RSS (8) = ÔTÔ = (LU)! (LU) = U'L/LU = UTLU 
It follows that: 


[Rss (ô)| = E[UTLU] 
= E [trace (U'LU)| 
= E [trace (Lu'v)| 


= trace ( [LuTU]) 


= trace (LE [U' U]) 
= o?trace(L) 
= (n-—K) o? 
and: f 
RSS (3) 
[27] = E i 
n—K 
. We have: 


UTLU = c*((cn)" u) L (er) U) 
= oV'LV 
Since V! LV is a normalized Gaussian quadratic form, we have: 
V'LV ~ x2 
because v = rank L = n — K. We deduce that: 


ê? = 


We have: 


cov(3,0) = E G 8) (6 -0)"] 
( 


:[ (6 (XTX) ' XU) (LU)"] 


= BE[U| L' +(X™X)'xXTE[UUT]L 
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7. We deduce that Ê and 6? are independent, because 62 is a function of Ô. Moreover, 
we have: 


and: 


It follows that: 


10.3.2 Linear regression without a constant 


1. We have: 
yı 1,1 Ti K 
Y : , x= : 
Un Tn Tn. K 
By £1 
p= |: |, =|: 
Bn En 


where £ ~ N (0,07I,,). The sum of squared residuals e! e is! : 
ele = (Y—X8)' (Y - X0) 
= Y'Y-g8'X'v Y'X848'X'Xg 
= xy Yy-oe8 X Ww RBU'X XA 


It follows that: 
B = argmine'e 
1 
= argmin 5 8' (X'X)8- 8! (X'v) 


B is the solution of a QP problem with Q = X' X and R= X' Y. 


lWe have Y! X8 = (v'x8)' = 8 X! Y because Y' Xf is a scalar. 
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2. (a) We consider that there is a constant in the explanatory variables and we note 
X= ( 1 X, ) where X, is the matrix of exogenous data without the constant. 
We write the coefficient 6 as follows: 


_ (Bo ) 
p= (7 
The first-order condition of the previous optimization problem is QB = Ror 


X'XÓ — X" Y. We deduce that: 


1'15,4-1'X,À, = 1'Y 
Xlifo-XIX,B, = XY 


If the residuals are centered, we must verify that 1'é = 0 or 1! (v — Bo1-B.X.) = 
0. We have? : 


i'(v-A1-Ax) = 
= 1[18)+1'X,.8,- 1781-1" BX, 
= trace (17140) + trace (1 x.À.) — 


TY — 1761-17 5,X, 


mn 


= trace (17 501) + trace (1 .x.) a 


= 1'ĝ1+1'ĝ, X,- 1'ĝ1-—1'ĝ,X, 


Adding a constant in the explanatory variables allows to center the residuals. If 
there is no intercept in the linear model, there is no reason that the residuals are 
centered. 


To center the residuals, we must add the constraint 1'e = 0. We have 1'e = 
1! Y — 1' X$, which implies that 1! X8 = 1! Y. The QP problem becomes: 


P ug. ^ 
S 


^ 1 
B = argmin zu (x'x)g- 8! (X'Y) 
st. (1'X)8- (1' Y) 
We obtain a new QP problem with Q = X' X, R= X'Y, A=1'Xet B= 
1'Y. 
'To transform the implicit constraints, we consider the explicit parametrization: 
B=Cy+D 


where C is the orthonormal basis for the kernel of the matrix A = 1'X and D 
is defined as follows: 


— 
e 
— 


D = (A'Ay'A'B 
(x'11'X)x'iu'v 


?We use the following properties: 
e trace(a) — a if a is a scalar; 
e trace(AB) = trace (BA) if the matrix multiplication BA is defined. 
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where (AT A) * is the Moore-Penrose inverse of the matrix A' A. As the dimension 
of B is K x1, the matrices C, and D have the following dimensions K x (K — 1), 
(K — 1) x land K x 1. The objective function becomes then: 

f(8 = 38'(X'X)8-6' (X'Y) 


(Cy -- D)! (XTX) (Cy 4 D) - (Cy + D)! (XTY) 


NIe w]e 


1 1 F en 
= 57 CTX XCy + ;DX XC t 57 CUX TX 


1 
D'X'XD-4'C'X'Y-D'X'Y 


= a (CTX'XC)y+7!' (C'X'XD-C'X'Y)+ 
(Go"xxp - XY) 


We deduce that: 


4-(CTX'XO) CTX! (Y - XD) 


and: 


B-C(CTX"XO) C'X"(Y-XD)4 D 


'The analytical solution consists in computing C, D and finally B. 


10.3.3 Linear regression with linear constraints 
1. (a) We have: 
RSS(8g = U'U 
= (Y¥-Xs)' (Y - X9) 
= (Y'-g8'X")(v -Xg) 
= cy yoga kK" y—y'xep+e x xe 
= pg'X'xp-og'x'v.v'v 


(b) The first-order condition is: 
ð RSS (£) 
0B 


—-2X'xg-2'Xx'v-0 


We deduce that: " ES 
B-(X'X) X'Y 
(c) We have: 


8 = (X'X) X'v 
(X'X) X (X8 4- U) 
= 84(X'X) XTU 


I 


Since X | U, B is an unbiased estimator of 6: 
1 


8| = Efe + (XTX) xT] 


= 2+ (XTX) E[X"U] 
= $ 
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and the variance of B is: 


s) = «[6-36-37] 


= o (X'X 
2. (a) We have: 
B =  argminRSS (£) 
AB=B 
s.t. { CB>D 
We deduce that: 
B = argmin : 8' 2X'x)8— 8' (2X' Y) 
Aß =B 
s.t. { CB>D 


We obtain a QP program with Q = 2X'XK and R= 2X ! Y. 
(b) We obtain ĝi = —1.01, Ê> = 0.95, Ês = 2.04, By = 3.10 and 8; = —0.08. 
i. If S Bi = 1, we have: 
A=(1 1 1 1 1) and BSA 


We obtain f, = —2.40, 62 = 1.08, 83 = 0.49, B4 = 2.43 and fs = —0.60. 
ii. If £1 = Bo = Bs, we have: 


1-100 0 0 
4-(3 60 9631 us »-(1) 


We obtain £4, = f; = Bs = —0.08, 83 = 2.22 and £4 = 3.17. 
iii. If By > Bo > B3 > Ba > Bs, we have: 


1 -1 0 0 


We obtain £4 = 1.33, f = 1.33, £3 = 1.33, 84 = 1.33 and f; = —0.23. 
iv. If 84 < Bo < B3 < B4 € Bs and S B; — 1, we have: 


A=(1 1111), B=1, 
and: 
sp. is Hh. nd 0 
0-1 1 00 0 
EE EXE I E 
0 0 0-11 0 
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We obtain 6, = —2.63, 82 = 0.91, 63 = 0.91, 84 = 0.91 and Bs = 0.91. The 
first-order condition of the QP program is: 


QB—R—A'A44-C'Ae 20 


where A4 is the Lagrange coefficient associated to the equality constraint and 
Ac is a vector of dimension 4 x 1 corresponding to the Lagrange coefficient 
associated to inequality constraints. Moreover, they verify the Kuhn- Tucker 


conditions: : 
min (Ac, Ce - D) =0 


Since A4 = —192.36304, we have: 


0.0000 
3.7244 
C'àc = -(QU—R-A'A4)— —2.8742 
24.7449 
—25.5951 
We deduce that: 
0.0000 
cs 3.7244 
E 0.8501 
25.5951 


(a) We have: 
1 
f (B; X) = 58" (XTX) 8 — 8 (XTY) - A (A8 - B) 
The first-order condition is: 


M -(X'X)8- (X'Y)-A'A-0 


We have then: 7 i i 
B=(X'X) (X'Y)-(X'X) A'A 


Since AG = B, we have: 
A(X'X) (X'Y)-A(X'X) ATA=B 


Or: 


\= (A (XTX) 47) i (A(X"X) (XTY) - B) 
We deduce that: 


8 = (XTX) (xTY)- 


(XTX) AT (AQ x) AT) (age) (KTY) - 8) 


8-(X"x) "AT (Ao(X"X) SAP (48 - B) 


— 
c 
we 


To transform the explicit constraints into implicit constraints, we consider the 
parametrization: 


B=Cy+D 
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where C is the orthonormal basis associated to the kernel of A and D = 
(AT A)* AT B where (A A) is the Moore-Penrose inverse of A' A. The objective 
function becomes: 


RSS(8) = 287 (X'X)8- 8! (X'v) 
- ; (C4 + D)! (XTX) (Cy 4 D) - (Cy + D)! (XTY) 
= io C'X'XCy+D'X' XCq) + 
;Q C'X'XD+D'X'XD)- 


y'C'X'Y-D'X'Y 
E Y (C'X'XO)6- 4 (C'X'XD-C'X'Y)« 


1 
(s5 x'xp E XY) 


Therefore we deduce that: 


4-(C'X'XC) C'X'(Y- XD) 


and: 


B-C(CTX'XO) C'X'(Y-XD)4D 


'The expression of the estimator under explicit constraints is: 


8 = B-(X'X) AT (A(X'X) ' aT) (A8- B) 


Boc (1-4 (A(X"X) AT) AQUxX)!) : 


= (X 


(XTY) + (KTX) ` 47 (A (XTX) AT) B 


whereas the expression of the estimator under implicit constraints is: 


B. cm X XO) (CURT (Y XD 
= C(C'X'XO) C'(X'Y)« 
(1-c(C7X'xc) *CTXTX) D 


We also have AC = 0 and D = (AT A)* A'B. For any positive definite matrix 
M, we have: 


M^! (1— AT (AM-147) AM) =C (CTMC) ‘CT 


and: 


(XTX) "AT (A(X"X) "a = 


(1- c (C7X"x0) ' c"X"X) (ATAY A 


We deduce that the two estimators are equivalent. 
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(d) If 6; = b2 and 6; = Bs + 1, we have: 


pod oue. 0 0 
4-(3 0 0 0 a) and 5-(1) 


We deduce that the estimator under explicit constraints is: 


B = (xx) (1-47 (a(xTX) an) ATK) 


(XTY) + (7X) " 47 (A(KTX) AT) B 


0.28040 

0.28040 

€ 2.08942 
3.21265 
—0.71960 


We can write the explicit constraints into implicit constraints: 


B = Cy+D 
1 
0 0 Jm 3 
00 + i 
V3 Yı 3 
= 1 0 0 yo |+ 0 
0 1 0 3 : 
1 —2 
0 0 d 3 
We deduce that: 
= O RO QUY XTX) 
2.08942 
= 3.21265 
—0.09168 


We obtain the same solution: 


B = C+D 
0.28040 
0.28040 
= 2.08942 
3.21265 
—0.71960 


Remark 2 The matrices C and D of the previous B = Cy + D correspond to the orthonor- 
mal matrix of A and the matriz (ATA) ATB. However, there exist many decomposition 
B — Cy+D because the only restriction is that C is an orthogonal matriz of A. For instance, 
if we choose: 


B = Cy+D 
1 0 0 0 
100 y 0 
= |010 y |+| 0 
001 ^3 0 
100 —1 
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we obtain: 
4 = (C'X'XO) C'(X'Y-X'XD) 
0.28040 
= 2.08942 
3.21265 
and: 
B = Cy+D 
0.28040 
0.28040 
= 2.08942 
3.21265 
—0.71960 


10.3.4 Maximum likelihood estimation of the Poisson distribution 


1. We have: 
L(A) = 
It follows that: 
O£(X) 
QA 
2. We have: 
We deduce that: 
T 


Sun [Yi = yi} 


i=l 


The variance based on the Information matrix is then: 


3| 


var (1) = 


If we use the Hessian matrix, we obtain: 


var ( 


We obtain the same expression. 


M 


n 
ja Yi 


À 


)=5 
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10.3.5 Maximum likelihood estimation of the Exponential distribution 


1. We have: 


L(A) = Se er? 
{=l 


n 


= nlnÀ- AM yi 


It follows that: 


OX = 
4 n 1 
e À —= T — X 
2i Vi y 
2. We have: 
PLA — (On 
OM X% 
We deduce that: " 
10)- 3 
The variance based on the Information matrix is then: 
82 
var (4) = 2d 
n 


It is equal to the variance based on the Hessian matrix. 


10.3.6 Relationship between the linear regression, the maximum likeli- 
hood method and the method of moments 


1. We have: 
n 2 
= n n 2 1 Yi — a; B 
£(0) — 5 In 2n 5 ine pa " ) 
z 
E 7 in Qn "ine? eae) ee) 
2 2 2g? 
The vector of parameters @ is: 
0= B 
c 
2. It follows that: 
O£(0) 2X' (Y — X) 
OB 2g? 
X'Y-X'Xg 
0. 
and: 
0£() — n , (Y-X8) (Y - X$) 


Oc? 20? 20% 
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We deduce that: 


0 £ (0) - MT 
EU Ge X'Y-Xx'xB- 
0B 0 e B=0 
e B-(X'X) X'Y 
and: 
Xi 
0:0), uos ,(¥-%8) (¥-x8) | 
do? 262 ' 261 
AT 
ya x5) (v-x6) 
TL 
We verify that Bur, = Bors and 625, < 02, because: 
AIR A 
f (Y e xô) (v = xô) 
COLS "NNUS 
3. We have: 
0? £ (0) X'X 
opoaopt c? 
840  -X"'(Y-Xp) 
OBI? c^ 
X'U 
aL) n (Y-X68) (Y-X9) 
00200? 204 c6 
| n U'U 
|». 92g^ c9 
It follows that: 
H(6) = —X'X/o? —X!U/o+ 
~ \ =XTU/o4*  n/ (204) —-UTU/o® 
and: 
1(6) = -EJH (6) 
X'X/c? z [XTU] /e* 
i [XTU] /o* E[U'U] /o° — 3/04 
|. ( X'Xj? 0 
B 0 2 Jot 


because we have E |X! U] = 0 and E[U' U] = E [377 u?] = no?. We deduce that: 


var(0) = I(0)! 


_ Gols 0 ) 


0 26^ n 
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Finally, we obtain: 


and: 


We notice that the expressions of var (Bui) and var (Bors) are similar, but they do 
not use the same standard deviation of residuals 6. 


10.3.7 The Gaussian mixture model 
1. We can write Y as follows: 
Y = BY + (1 - B) Yz 


where B is a Bernoulli random variable independent from Yı and Ys, and whose 
parameter is mı. We have: 


a[y*] = : [avi + a - B) ¥2)*| 
k 
"OP. k bb — i 
- I: (i) on (a - B) Y») 


Il 
Me 
oN 
LE 


: [BY (a - B) Ya) 
i—0 


Since Y; and Y2 are independent, we have E [uv ((1— B) Ya)'| when i Æ 0 or 
i Æ k. It follows that: 


: [Y^] E : [BP] + ia = BY vf] 


= E[B*]E[Y*] + :ja - 5» [v7] 
= mE[Y^] + mE [V7] 


because B is independent from Y; and Yo, B^ ~ B (mı) and (1 — B) ~ B (73). 
2. We deduce that: 


z [Y] = T1 z [Y1] + 12 i [Y2] 
= mipi Tala 


and: 


var (Y) = E[Y?] - E? [Y] 
= mE[Y?] + mE [Y7] - E? [Y] 


Since we know that E [Y?] = u? + o2, we obtain: 


var(Y) = m (ui +0?) + m2 (ug +03) — (mui + ohio)” 


7101 T T203 TTI (1 = 71) (ui + n3 = 2771 72 H1 H2 


II 


2 2 2 
= T10{ + T205 + m7 (pi — ua) 


because 79 = 1 — 71. 
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3. We remind that E [Y?] = u3 + 3:07. It follows that: 


ir - ED?) = E[v?]-sE(v]wr(v) - E? Y] 


= om (ui 3uioi) + T2 (u3 + 3u203) — 
3 (714 72412) (moi + T202 + mT (u1 — ua") - 
(mida + mop2)° 

= omm (T2 — T1) (ui — p2)? + 3r (pi — ua) (01 — 03) 


We deduce that the skewness coefficient is equal to: 


T172 (m —m) (ui — ft)” +3 (u — ua) (0? — ei)) 
yı (Y) = , 3/2 
(mo? + 7302 + 1172 (u1 — H2) ) 


10.3.8 Parameter estimation of diffusion processes 


1. The solution is: 
X (€) = X (s) e 12) -9s wo - we) 


It follows that: 


In X (t) — ln X (s) = (» 50°) (t — s) +0 (W (t) — W (s)) 


Since W (t) — W (s) ~ N (0,t — s), we deduce that the log-likelihood function of the 
sample X is: 


T 

1 E? 
£(u,c) = Ta X (mrm (c? Ati) + i) 
i=1 $ 
T T 
T T 1 1 e? 
= ln2 Ino? J In At; — = J - 

MM Mes n 2 £5 gà At, 


where At; = t; — t;_1 and ej is the innovation process: 
l » 
E&i = ln z; — ln z;:i1 — H- 39 At; 


2. The solution is: 


t 
X (t) 2 X (s)e-**-9 +b (1 — pu + of e 2") AW (u) 


S 


where: 
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We deduce that: 


T 2 
£(a,b,c) = -39 (nze«m(£ 0-799) - 


i=1 
fa 2ae? 
2 = o2 (1 —2aAt ) 
T c? 1 T —2aAt 
= 5 nan yos jouer ) 
ae? 
c? (1 — e-«At;) 
where: 
& — im é At: p ase) 
. We have: 


X (t) — X (s) e u (s, X (s)) (t— s) + o (s, X (s)) (W (t) - W (s)) 


We deduce that: 


T 
1 7 
22. (1 m+ In (e? (ti-1, 4-1) Ati) + = Gam) 


i—l 


where:e; = zi — £i—1 — H (Ei 1, 24-1) Ati. In the case of the CIR process, we obtain: 


T 
T T 1 
L(a,b,o) = 5 ln 27 5 In c? 5 2 (zi 3 At) 
T 
1 e? 
: 10.1 
2 3 0?zj 1 At; ( : ) 


where: 

Ei = Ti — Ti—-1 — 4A (b — Xi-1) At; 
We assume that X (s,t) = X (t) | X (s) is normally-distributed M (m4 (s,t) , ma (s, t)) 
where m; (s,t) = E[X (s,t)] and mz (s,t) = E es (s,t) — m (s, 1))°| .Then, we have: 


T 2 
1 = ti—1,ti 
£ (0) > (sae + amete- 1,5) 4 Gr o s) 


ma (t; 1,1) 
In the case of the CIR process, we have: 


mai (f 1, ti) = z,.1e 9^5 +b (1 = Bees) 


—a4Atij ét —2aAt; ES —aAt, 2 
ma (ti-i; ti) = o? (s- G - , a ie | | 


and: 
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We deduce that: 


2 


T T 
L(a,b,0) = z In?" 5 no 


T —aAti —2aAt; —aAt;)2 
1 (e i—e i) l (1 —e +) 
2 > 4 ln (s- " T b 2a 


jm e? 
32 o7x;_1At; (10:2) 


where: 


Er =m Bee ooh —b (1 m gU) 


When At; — 0, we have e-?^*: zz 1 — aAt; and e-?9^t: z; 1 — 2aAt;. It follows that: 
(e "A5 —e7?955) —  (1—aAt;) — (1— 2aAt;) 


a a 
A At; 


and: 


(sets) TAC AE) 0 -— 23A) 


2a 2a 
z 0 


We deduce that mı (ti—1, ti) 7 Ti-1 (1 a aAti) +abAt;, T" (t1, ti) 2 0?z4 1At; and 
& R$ Q4 — Xi—1 (1 zw aAti) — abAt; 
YY M- Bii —a(b— 2-1) At; 


We conclude that the log-likelihood functions (10.1) and (10.2) converge to the same 
expression when At; — 0. 


. For the geometric Brownian motion, we have E;,_, [e;] = 0 and E;, , [£2 — c? At;] = 0. 
We deduce that: 


{ hia (u, a) = ln Ti ln Ti—1 (u ic?) At; 
hi»(u,c) = (In zi — ln ti—1 — (u = io?) At) — o? At; 


For the Ornstein-Uhlenbeck process, we can use the same two moment conditions and 
the orthogonal condition E;,_, [e;z; 1] = 0. Finally, we obtain: 


hia (8) =p = xj_1e 745 —b (1 — ereae) 
hia (0) = (zi — zie *^* — b (1 — M o (a) 
his (0) = (aj — vj 1e *^* — b (1 — e-*^*5)) xj, 
For the CIR process, we proceed as for the OU process: 
hi1 (0) = zi — mı Ea) . 
hi 2 (0) = (zi — mi (ti-1,5)) — ma (tiit) 
his (0) = (xi — ma (1, ti)) «i1 
where: 
my (tizi ti) = 2;-16 "^^ b (1— e7*^5) 


and: 
(ee E gp 


m» (ti-1, ti) = o? (s- 


a 2a 


Statistical Inference and Model Estimation 197 
5. If we use the Euler-Maruyama scheme: 


X (t) — X (s) ~ a (b — X (3) (t — S) +o |X (S) (W (t) - W (s) 


we obtain: 

his (0) = zx; — mi (ti-1, ti) . 

hi 2 (8) = (xi — mı (4,03) — Mə Qin) 

hi 3 (0) = (zx; — mi (ti—1, ti)) vii 

hia (0) = | (£i — mi CERNE — m» (ti1,)) Tii 
where: 

T" (51, ti) = ği Fa (b = Xi-1) At; 

and: 


m» (ti—1, ti) = e? [ri At; 


10.3.9 "The Tobit model 


1. We note X = X | X > c the truncated random variable. The probability density 
function of X is equal to: 


f(x)- 


where a = o^! (c — u). We have?: 


[X] = cuf nmm 1e 
z mo €x ji) 


1 i oo 
ni 17 («f $ (y) dy + e |-6 (y)]; ) 
= ptoaX(a) (10.3) 


where A (o) is the inverse Mills ratio: 


3We use the change of variable y = o^! (a — p). 
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We have: 


We deduce that: 


where: 
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1 PEE lacu 
LA d 
om zo A o )) i 


T oy)? exo ( j^) dy 


Iw G [ew dy + 2uo [-9 (y)] + 
of y^ó (y) ay) 
zuon la) cs (Iwi [^ 60) av) 


= pg?-42uc(a) 4 
c? (1-- aA (o)) — ui? — 2ucA (a) — o? X (a) 
e) 


We can show that truncation reduces variance because we have 0 < ô (o) < 1. 


2. The censured random variable Y can be written as follows: 


We have: 


We also have: 


I 


II 


~ X if X>c 
BSN if X<c 


Pr{Y = co} E [Ý | X <c] -Pr(Y Ze) E[X | X » d] 


Pr{X « cj ec Pr(X > cE[X] 
9 (o) c t (1— 9 (o)) (u + e^ (a)) 


Pr(Y 2cJE[Y?| X « e] - Pr(Y Ac} E[X?| X> e] 
$ (o)c? + (1— 9 (2) E[X?] 
9 (o) c? + (1 — 9 (o)) (i? + 2ueA (a) +07 (1+ o (0))) 
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We deduce that: 


var (Ÿ) 


7] - [T] 
= $(o)c + (1— 9(a)) (i? + 2ueA (o) + e? (1-- oA (o))) — 
$? (o) c? — 26 (a) (1 — 9 (o)) (uc + ocA (o)) — 
(1 — 9 ())* (u? + 2ue (o) + 07d? (a)) 
= 9(o0)0-9(o)c + O(a) (1—- ae 
a) 6 (a) wo + (¢ (a) (a — ¢ (a)) + id )) 2? — 
a) (1 — ® (a)) uc — 28 (a) ¢ (a) o 
= $ (a) (1 — 8 (a)) (c — u)? — 28 (a) d(a) (c — u) e + 
(1 — ®(a)) (1 +A (a) (a — ¢ (a))) e? 
= $(o)(1-— 9 (o)) o?c? — 28 (a) ¢ (a) ac? + 
(1— (a) (1-6 (a) — A (a) (a) +2 (a)) o? 
= c*(1- 9 (a)) (9 (a) o? — 28 (a) A (a) a 
1 — ó(a) — à (a) 6 (a) + A? (a)) 
= e*(-&(a)) (( —5()) + (e d(a))? 9 (a) 


because we have: 


I 


oc 


—€ (a) X (a) — A (a) ¢ (a) + M (a) = 0 


3. In Figures 10.1 and 10.2, we have reported the corresponding probability density 
function of the truncated A variable X and the censored random variable Y. We 
obtain [X |= = 3.7955, : [X?] = = 18.3864, c (X )= = 1.9952, LY |= = 2.7627, ij = 
11.9632 and c (Y) = 2.0810. We verify that truncation reduces variance: o (X) < 
c (X). In the case of truncation, some observations are excluded, implying that we 
observe only a part of the probability density function. In the case od censoring, the 
probability density function is a mixture of continuous and discrete distributions. In 
particular, we observe a probability mass at the censoring point X — c. 


4. We have: 


I 


Pr{Y =0} Pr {Y* « 0) 


= Pr{x'B+U <0} 


We deduce that the log-likelihood function is equal to: 


Sa-a (i-e (82))- 


n PORA T 2 
Sod (m2r - Ino? 4 (5 Ti 2) ) 
c 


i=l 


NI rR 


where d; is a dummy variable that is equal to 1 if y; > 0. 
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1 2 $ 4 3 6 7 8 9 10 11 12 135 14 


FIGURE 10.1: Frequency (in 96) of the truncated random variable 


FIGURE 10.2: Frequency (in 96) of the censored random variable 
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5. We have: 


and: 


We also have: 


BOYTN. lug 
0c? No] 208 


8£(0) 1 di 1 
"B à (a) et ad wean 4 


T T 
where $; = $ (25). $;— (=£) and nı = X; di. 
c 


6. Since ro (x) = —x$ (x), we have: 
0 ( $ f (x)) ) 9f ()) (6 Cf ()) — f (2) (1 — 9 Cf ()))) f^ a) 
dx \1—®(f (z)) (1 = 9 (f (z))) 


It follows that: 


at (5) TA ae (o deg (=2)) = 


0 Qi EH Qi (zib m 
32 (Gz) ^ 9g3 (1— $,)? (o (1 2) ( m ))ss 


For the Hessian matrix, we obtain: 


and: 


and: 


EE E ee CP))) u 


1 
zi 2o (cm Boi 
dj=1 
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We also have: 


and: 


Ob) _ 1 b (sa so (£ j+ 


2 Ny 
E 2- (yi = x; b) T 354 


7. By multiplying the system of equations (10.4) by 8! / (207), we obtain: 


1 Qi F 1 TA at 
- j — m= T esi 
203 d (; t) a 2g4 m (y Tj 8) x; p 


Combining this result with Equation (10.5) gives: 


l 1 2 Ny 
d;=1 di=1 
We deduce that: 
k ( mE T Tr ^ 
d ox eee) 
1 
`- nm by (yi — 2j B) vi 
d;=1 


Let D; be the Bernouilli random variable such that: 


Pr{D;=1} = Pr{Y* > 0} 


à (22) 
o 


Let Q; be a random variable that is independent from D;. We have: 


J bs d =E pat = pya = > (1 — 8) E[N] 


i=1 ji—1 


and: 
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we obtain: 
0? £ (0) 1 e 09 » 
"Isa58T | = TENE PE = 
[55557] gi a — 9,)? (bi - ( i) zi) viv, 
leal 
i=1 
= — M ani, 
i=1 
where: 


2 [D; (yı — 2; 8)] = E[DUi] 


It follows that: 


di—l t=1 
and: 
0? £ (0) o 2 
E | = E 
where: 
1 2» 1 
bi = =l Qi LÁ | 6:22) 5179 
= uU 2,4, 915 
= 2g3 (osi 1 Qi 1— =) 
We have: 
X (yi B lg = XO (yi zi B) (yi — 2i B) 
di—1 di=1 
= mo? — M (vi-a az. 
d;=1 
and: 


4 6 
20 [o Si = 
lox ME 
= -gad Mita) 05 
i=l i=1 


We deduce that: 


203 


204 


Handbook of Financial Risk Management 


where: 
1 0228-1 ats d 1 
G = a ( 30% 2i ico pizi | + ara oa Piri 
1 3 CHA 
= E ee (eis + Qizi — 1- ð, — 20, 


We conclude that the information matrix is equal to: 


I(0 = EI 


Di uzir bun 
ii biti Xiz Ci 


We retrieve the formula obtained by Amemiya (1973). 


II 


. We note Y, the nı x 1 vector of the explained variable and X, the nı x K matrices 


of explanatory variables when the data are not censured. We also notice that: 


$i 
1-9; 


=À £2 b) 
where À is the inverse Mills ratio. The first-order condition (10.4) becomes: 
—6Xi Ao + XI (vi E xi) =0 


where Ao is (n — ni) x 1 vector of inverse Mills ratio and Xo is the (n — n1) x K 
matrices of explanatory variables when the data are censured. We deduce that: 


Bos (xx) Ree (GY Xlo 
B =ê (XIX) Xj Ao (10.6) 


II 


It follows that the OLS estimator Bi based on non-censured data is biased. 


. We apply results obtained in Question 1 to the random variable U with u = 0, 


c= —z' f and a = o^! (c — p). We have: 


E[Y|Y »0 = E[r'84-U|U » —z! 8] 
= z'84E([U|U » —z' f] 
= g'B+or (52) (10.7) 
oO 
and: 
oY |Y «0| = E[z'8c-U|U x -z! 8] 


= cz'84E[U|U x —z' £] 
MB 0\ (53) (10.8) 


I 
8 


10. 


11. 
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Using Question 2, we obtain: 


TY] = ? [max (c'B+U,0)] 


= x' B+E [max (U, -«' 6)] 


-— o( 78) ap +0(1 »( EET 2 


Eeoae) 
| 


=F) [Y | Y > 0] 


Oo 


From Equation (10.7), we deduce that the corresponding linear model is: 
Yı = Xıő + cAi 
and: 
E n —1 

B = (XiX) XjYi-6(XiX) XjM 

= Ai -e (XTX XIA (10.9) 
The difference between the estimators (10.6) and (10.9) is the term X A which is 
calculated with censored data in the maximum likelihood and non-censored data in the 
last approach. However, the estimators (10.6) and (10.9) can not be used in practice 


because they depend on 6 and on the inverse Mills ratio that is a function of B and 
ô. 


The ML estimates are 90M? = 2.8467, 9(MP) = 1.0843, 8B” = 0.9869 and 6™L) = 


5.5555. The OLS estimates based on the non-censored data are Bios) = 6.2002, 
BLOM) — 0.6757, 5(9 9) = 0.7979. We verify that: 


(ML) (ML) 
1 2 


BOES) _ MD) (XTX) XXL AM = | 10843. | = BOM) 


and: 


(OLS) _ MD) (XTX) XTAOD = | 1.0065- | 4 901 


The ML estimator combines the non-censored data — B (OLS) _ and the censored data 
z XIAMD This is not the case of the second estimator, which is only based on 
non-censored data — (919) and X] A(MD. The second estimator is then less efficient 
than the ML estimator since it does not use all the information provided by the data. 


The conditional predicted value of dj; is: 


=o. x a} BML) t 
T, BML) — G(ML)A EE ify; € 0 
R= HA 
a? BML) 4 MD) p ig >ü 
a G6 (ML) Yi 


whereas the unconditional expectation is j* = x} B (ML). These values are reported in 
'Table 10.1. 
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TABLE 10.1: Predicted values i; and $7 


i 1 2 3 4 5 6 T 8 9 10 
Vi 4.0 0.0 0.5 0.0 0.0 17.4 18.0 0.0 0.0 9.7 
Ü; 3.5 —127 51 -59 -5.9 143 17.6 —42 -54 4.0 
j; —3.0 —12.6 16 -34 —3.4 14.2 17.6 0.8 —2.4 —1.4 
i 11 12 13 14 15 16 17 18 19 20 
Yi 9.7 1.8 6.5 26.1 0.0 5.0 21.6 6.2 9.9 1.4 
yz 9.3 5.1 6.2 16.3 —15.1 3.6 19.6 5.9 9.3 6.4 
0 8.6 1.6 3.9 16.3 —15.1 -25 19.6 3.4 8.6 4.3 
i 21 22 23 24 25 26 27 28 29 30 
Vi 5.0 0.0 0.0 18.1 0.0 T. 00 0.0 0.0 4.0 
UF 4.1 —5.8 -94 15.7 -3.3 17.3 -6.2 —54 —34 12.2 
0; 0.8 3.2 8.8 15.7 38 17.3 —3.9 —2.3 3.2 12.0 


10.3.10 Derivation of Kalman filter equations 


1. We have: 


We introduce the notation à; = o, — ó;.. It follows that: 


and: 


ôr 


rô 


We deduce that: 


II 


Oti—-i = 


2—1 [o] 


= Tiori + ce + Rem — (TrGe-1e-1 +c) 


TAE à [a1] + e 


Tiós4 A1 + Ct 


= Ti (oua — 64 31-1) t Rite 


= Ti (at-1 — óiaya) (oii — 6a 1) Ty + 


53 [Tiat + ct + Rint] 


2T, (oua x O43) n RE + 
Rimi Ri 


Pa 


2. We have: 


;L—1 | (os = Âte—1) (a Z qi) | 


TP; iai 25 RQR 


Ut 


Ye — Eia [ye] 

Ye — Er—1 [Zio + di + ei] 
Ut — Ztâtlt-1 — d; 

Zi (at — &su1) + & 


TE, 4 [c 1 — és apii) (083 — âti 1) | T, + 
2TJE, 1 (04-1 — Giai) m] RE + 
RE a [mm ] Ri 
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Since à; is a Gaussian vector, v; is also Gaussian with: 


uiu] = Ecaálw- Zqea — de] 
= 4E [os — Aye 1] + Eia [e] 
= 0 
and: 
" 
FQ = E 1 [Cv = 0) (ve — 0) | 
= Epi (2 (o4 — âtjt-1) + €) (Ze (ar — ài) + e) | 
= ZE; [la = âtt-1) (o4 = G1) | Zl +E fered | 
= Z:Py41Z) +H 
3. We have: 
t-1 farv | = dE les (Zi (os — à&iu1) + «) | 
= E les (af — Sips) Z +E; [are | 
= Lt—1 [C = ósu-1) (af ud aj.) Al + 
unm 174—1 e m M Z 
= Pas. 
and: 


Qt oe Qr 

Ut 7 Z (o4 — Gtje-1) + & 

JT 0 Ot 3: 0 

Zt In €t — Za Üu-i 


Conditionally to the filtration F;_1, the random vector (az, vz) is a linear combination 
AX, + B, of the independent Gaussian random vector X, = (o4, e€;). We deduce that: 


Ot ~N Ôtlt—1 Pia PqcaZ 
Ut 0 NZ Papa F; 


4. We deduce that: 


Il 


II 
> 
a CUN 
eg 
cd 
+ 
cJ 


Atle = E; [o] 


= c [a | v — 9 — ZtÔtlt—1 — di] 


Using the standard results of the conditional distribution, we obtain: 
Ĝtjt = Ott + Baciz H^ (ye — Ztôrt-1 — di) 


and: 
Pj = Bei — Pea Zi Fy Za 
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5. The Kalman filter corresponds to the following recursive equations: 


Aqui = TiÓ s Lii + 6 

Pia = TePe- T; + RQUI 

Ve = Ye — Zi Qtjt—1 — d, 

F; = ZP- Zi + Ai 

Ate = Oui + Pa Z] Fy iu 

Pas = Pra — Papi Z6 Fy Ze Pie 


We have: 


trilt = Teri Gee + Cua 
^ T p-1 
= Tha (Gea + Pye-1 Ze Fy vi) + Cea 
Tixióquoi + Cet + Kiwi 


II 


where K; is the gain matrix: 
ky = Ti Pie, Fo 
Since we have v, = yt — ZiÓqu 1 — d+, we can write the state space model as follows: 


Ye = Ziéquoi + de + vi 
tyit = Traduit Ceo + Keve 


If v, = 0, then Qy4ije = Trsióquo1 + ceci. Ky indicates how the filter changes the 
classical estimation Ty11@4j¢-1 + Ct+1 when it takes into account innovation errors. 
Therefore, K; is the correction matrix of the prediction-correction method. 


6. We introduce the process y = ^4-1 with yo = 1. Another representation of the state 
space model is: 
Yt = Zar + deve € 
at = Tiati + eye + Rint 


Vt = Vt-1 
We obtain: 
yt = Zioa% 
af = Trot, Rini 
where: 


Z=(Z% dh h) 


T; Ct 0 

T =| 0 1 0 

0 0 0 
R, 0 
R-|[0 0 
0 lh 


The state vector becomes af = (a4, y+, €;) whereas the noise process nz = (me, €+) is a 
Gaussian random vector N (0, Q7) where: 


e-(& H) 
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7. If we apply the Kalman filter to the augmented state space model, we obtain: 


Ore 1 = Třâ 1|t-i 
Pit 1 5 Phy T + RQ PRT" 
E 15% te 1 
Uf = Yt— Tie 1 
Pl = BP i Zt 
One = m T Ph Lu FE 
Pu = Pica m Paz B "Ze 1 
We have á$ = (âo, 0, 0) and: 
P 0 0 
=| 0 0 0 
0 0 O 


We assume that P; 


"t has the following structure: 


Pe 0 V 
P-| 0 0 0 
V 0 W; 


We deduce that V; = RO and W; = Hi. Finally, we obtain: 


Ĝtjt-1 = Teât-1jt-1 + 6 

Pyar = TP 1T, + RQR 
Ütjt—1 m Zi qui td 

Vt = yt — Geje-1_ 
F, = Z:Py1Z) +2Z,R,C) + H 
Gi = Papi Ze on 

Att = = Gi 1 + GE; 


Pae = Pg — Ge; Gq 


10.3.11 Steady state of time-invariant state space model 


1. We have ay = (yr, &) and: 


(t 2) G4)» 
R 


Using the standard SSM notations, we have c = (p, 0), 
Sp m dS 
du ( 0 0 


It follows that: 


and: 


P es 
D 
] 
3 

n 
Il 
LA 
EET 
© 
= 
E 
O =. 
= 
lo 
MSS 
— 
way 


Il 
ZU 
T 
oce" 
=. c 
Ny 
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The steady state âs is then equal to: 


Ago = (15 -T) 5s 


We also have: 


xL c5 dod 
RQR sali 1 
and: 
o 00 0 
0 00 0 
TeT=| ò 000 
0000 
We obtain: 
zs 0 0 0 
1 
E 0 100 
h-TeT)'- 
gae) 0 010 
0 001 
and: 
vec (Pœ) = (I-TG&T) vec(RQR') 
1 c? 
icd e 
= Oe 
g 
c2 


We finally deduce that: 


. We have o4 = (yr, €4) and: 


0 —0 1 
sea Gene 
R 


Using the standard SSM notations, we have c = (p, 0), 


EGET 
Po) 


We obtain: 


and: 
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We have: 
100 -2 
010 0 
lL-T8T= 0 0 1 0 
000 1 
and: 
1 0 0 & c2 o2 (1+ 67) 
010 0 o2 o2 
vec(Po)=| 9 9 1 0 ge Ni g? 
E E 
0 0 0 1 c? c2 


Finally, we obtain: 


. We have o4 = (yr, €4) and: 


epu aret to ets 


Using the standard SSM notations, we have c = (1,0), R = (1,1), Q = o2 and: 


$1 —80 
"(t ) 


We obtain: 


and: 


We also have: 
$? —b10, -óà 9 


0 0 0 0 

Se pho 0 0 

0 0 0 0 

It follows that: 

$101 $101 01 o2 
1-97 1-92 1-42 1-92 5 
vec (P) = 0 1 0 0 Ce 
0 0 1 0 Ce 
0 0 0 1 Oz 


and: 
1—2¢101 +0? 1 
P = c2 1—ó1 
1 1 
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Using the standard SSM notations, we have c = (1,0), R = (1,1), Q = o2 and: 


We obtain: 
ü = 1 1-6) 01 H = H 
? 1-6, 0 1 0 0 
and: 
10 0 S 1 
I c2 1-07 
D. d 0 o? Ce 
vec (Pœ) = m 3 |= 31 |o? 
0 0 1 i5 ce 1-8 
E c2 1 
00 0 1-8? 


We deduce that: 


10.3.12 Kalman information filter versus Kalman covariance filter 


In what follows, X-! defines the inverse of the square matrix X and Y~! defines the 
Moore-Penrose pseudo-inverse of the non-square matrix Y. 


1. We have: 


(Im + AB'C?B) ^A = (L,-AB'C?B) (A 3)^ 


(AGRA ABC B) 
(A^ 4BTC-B)* 


2. If the relationship (Im + AB'C-1B) ' = Im — ABT (C + BAB") — B is true, we 
must verify that: 


() = m ABTC-!B) (Im - ABT (C+ BAB") ' B) 
zs qp 
We have: 
() = (Im+ABTC™1B) (Im -ABT (C+ BABT) ' B) 
= (L4 AB'C-!B) — (Im + AB'C-! B) AB" (C+ BAB") ` B 
= (IL, AB'C^B)- (Im + AB'C-! B) ABT (BC + AB") 
= (Ll. + AB'C-!B) - (Im + AB'C-!B) (B^ CBT AT! + Im) a 
Since we have: 
Im +ABTC1B = (AB'C^B)((AB'C-!B) > + Im) 


= (AB'C^B)(B-CB'" AT! + Im) 
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We deduce that: 
() = (Im +AB'C™'B) - 
(ABT C7! B) (B! CB" AT! + Im) (B-1CBT A! + Im) as 


= (Im+AB'C™'B) - (AB'C^! B) 
=> I 


3. Using Questions 1 and 2, we have: 


(Im + AB'C? B) A = (A7! + BTOB) + 


and: 
(Iot AB OTB) =ke ABT (OFBAB T B 
We deduce that: 


(471+ BTOB) = (L.-AB'(C- BAB')  BJA 


II 


A- ABT (C-- BAB') BA 
and: 
() = (L,-AB'C-B) AB'C^! 

= (apt B O B] BO 

= (A- ABT (C+ BAB") ' BA) BTC 
= AB'C^ - AB' (C+ BAB") ` BAB' C^! 
= AB'C^- AB (C-- BAB')  ((BABT 4 C) C^! - I) 
= BO -— AB" (C+ BAB") (BABI +C) 05 + 

ABT (C + BAB)” 

= AB'C^ - AB'C-! + ABT (C+ BABT) ` 


Finally, we obtain the expected result: 
(Im + ABT C? B)! AB'C? = ABT (C+ BAB™)~ 
4. We have: 


(=) = 
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5. Let V be a covariance matrix. The information matrix I is the inverse of the covariance 


matrix V: 


I=Vv-! 


This matrix is used in the method of maximum likelihood. 


6. The state aj), (resp. O41) is the estimator of a; normalized by the covariance matrix 
given the filtration JF; (resp. F4). 


7. We have: 


lPi- = 


because: 


—1 
Pa Pia 


i qu -1 
BS (Im — Pqii Za Fi A) Pia 
Pata Um + Pai Ze HE Zi) Pea 


Im + Z HEP: 


(ps Zim. 


T -1 
= Lh-PqgaZ (-EtAPgaZz) Z 


= T 


— Py aZ] (-H)  Z 


= Im + Pg aZ] Hi Z 


We also have: 


pru 


-1 
ly Pr aZe (Pi + Hi) 


s se BAZ Paes) Ol Bie eH 
= EIU ABA eH 


By using Question 4 with A — 


Tee Peje-1 Ze (Zi Pii Zi T Hi) 


8. We have: 


Zi Pg aZ and D = Hi, we obtain: 


1 = Zy Hy 


-1 
yi Pra 

=i 

= (TPT, + RQR} ) 
ES 
E (nii ad + RQR ) 
(b) 
Oda - Teje—1 Geje—1 


II 


Lys a TrÓs aia 


—1 ^X 
Iii, uy eae 
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Leje = p.i 


t|t 


= Py} 


T 5-1 —1 
t|t-1 (Im m Pu F; Zi) 
— Py} 


ea (Im + Peje-1Z)' Hy Ze) 
= Iy t Z Hy Z 


Qtt = Irán 
= Tne (qa Baz] B, (ye — Ztât-1)) 
= Tye(m—- Baz HF UA Ĝtjt—1 + 
In Paci Zi Fy ye 
= yit Z Hw 


We deduce that the recursive equations of the Kalman information filter are: 


1 
L-1 = (rar ee ud + I ) 


= Tyee UL 


Oi apti ipea 
"enis i+ Z Hy E 
Orie (7 RC LA H; E 


From a numerical point of views, the number of matrix operations is: 


e 5 additions, 10 multiplications and 1 inverse for the covariance filter; 

e 3 additions, 10 multiplications and 2 inverses for the information filter; 
It is not obvious that the computational time is reduced when using the information 
filter. Its advantage may be due to the inverse of I,_1);; that can be more stable 
than F; in some cases. 
. We have: 

nT 1 1 E 

£(80) = -—3 hm) — 7 al — 22 pA E 


In the case of the Kalman information matrix, we have: 


Zl a + H 
Vt = Ye — Zele Âte 
as =0 


Ip =0 


In the case of the Kalman covariance matrix, we set o ~ N (0, xIm) where & is a 
scalar sufficiently high such that Ip = &^11I,, c 0. 
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10.3.13 Granger representation theorem 
1. We have: 
Yt = Mt + $' yi + Et 
We deduce that: 


yt — Ye-1 = Hi + 9 yii — yia d Et 
and: 
Ayi = qu + (95 — In) i-i + Er 
2. We have: 
Ur = ht 95 yii + Phyt- + Et 
We deduce that: 


Ay = wet (91 — In) i-i + 95yi-2 9 Et 
= pet (0, +83- In) w_1 — OAM a + & 


3. The relationship is true for p = 1 and p = 2. We note: 


and 


We notice that: 


Il 
Im 
i Y 
=, HG 
Em 
| 
i 
MAE 
+ 
e 
w^ 


= ge) + S, 


and: 


We prove the relationship by induction. Let us assume that it holds for the order p—1. 
We have: 


Ay = Y 
p 
— quc Pu PiYt—i + Et — ya 
p-1 
= bet ey, S iyi Et — yii + 9,yi-p 
_ pP-l |(p—1 
= pm pag + ` QU DA, ied Pp Yt-p 
p—1 
= mt (ut? -= $;) Ye-1 + 2g e (oP + 2) Ayt—i + 
Et + Pp Yt-p 
p 
= quo II)y, , + ae DP Ayi F Et + h 
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Since ol) — ) = 0, the value of m is equal to: 


p—1 

m = —®,y-14 pS eL Ay; — OP Ay s + 9 yi, 
i=1 
p-1 


= —O,y%-1+ So oF, (Yt—i — Yt-i—1) + Dey 
i=1 


pY- + 9,yiai — 95yic2 + Vica + 
— $5yi-p + 95yi-p 
= 0 


Il 


It follows that the statement also holds for the order p. 


10.3.14 Probability distribution of the periodogram 


1. Since a(A;) and b(A;) are the sums of Gaussian random variables, they are also 
Gaussian. We have: 


D[a(A;) = eet 


z mE [ye] cos (Azt) 


and E [b (4;)] = 0. For the variance, we have: 


var(a(A;) = E (Fa Sonoma) 


1 E 
2d [yz] cos? (At) + 
t=1 
1 
- i [ysyt] cos (^55) cos (A;t) 
szt 
S xS 


_ - 3 (=e + +) 


If Aj Æ 0, we obtain: 


; c? g? 2 s (2d; t) 
Jim var (a(A;)) = gett ore E eee 
52 
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We also have: 
jou : 
var(b(A;)) = E||—-2» wsin(A;t) 
Jj yn 2. t j 
2 TL 
o JA 
= — M sin (Azt) 
iem 
FEES x: 1 — cos (2A;t) 
© n = 2 
f Aj Æ 0, we obtain: 
g? 
lim var(b(A;)) = F 
2 2 
We deduce that (Aj) ~ M (0, 5) and b(A;) ~ N (0, 7). 
2. We have: 


n— oo 


n—oo 


lim cov(a(A;),b(A;)) = lim | ES pom Qi Yrs) 


2 n 
: o x 
Jim E 2. cos (A;t) sin (A;t) 


= 0 


It follows that a (Aj) and 6(A;) are asymptotically independent. We conclude that: 


and: 


3. We verify that: 


and: 


Since we have: 


we deduce that: 


F(a? (Aj) +07 09) «x 


T (Aj) ~ X2 
Lu, 05 = AO? x p] = Fy (a) 
MO a RENI 


4 


Pr {0.0506 < x3 < 7.3778} = 95% 


IL . 
Pr {0.0500 «31i 02 < ams) = 95% 
y Jj 


Finally, we obtain: 


Pr (0.27 Iy (Aj) € fy (A) € 39.5 - I, (4)) = 95% 
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4. If A; = 0, we obtain: 


o? f1tl 
li 0)) = — pq 
a NL 2 E 


and: 


It follows that: 


and: 


—1 


For a white noise process, we have f,(0) = (27) o°. Therefore, we can make the 


hypothesis that: 
Ty(0)  » 


1m N 
nso fy (0) 2 


We notice that: 


lim E [Iy (0)] = fy (0) -E [x7] = fy (0) 


and: 
Jim var (Z (0) = f; (0) - var (x1) = 2/f; (0) 


10.3.15 Spectral density function of structural time series models 


1. We use the canonical representation of state space models. For Model (M1), we have 
Zi = 1, Ort = Ht, d; = 0, Hi, = o2, Ti 1, Ct 0, Ry 1 and Qt = o2. For model 


(M2), we obtain Z; = ( 1 »ha-(R)e-omo-en-(i D 


Bt 
a= (o J o rmaQ - 7a 2 
2. For Model (M1), we have: 
(1—L)y = (mit e) — Quoi + 4-1) 
= (Me — pai) + (Et — &-1) 
= mt(1—Lje 


Since the sum of two stationary processes is stationary, it follows that m + (1 — L) e 
is stationary. We deduce that the stationary form is S (yj) = (1 — L) y. The spectral 
density function is equal to: 


fsa QA) = (27)! (o? +/1— ga e) 
= (9s) (o + [1 — (cos (—A) + isin (—))|? e) 
= (20)! (o? + |1 — (cos À — isin A)|? s) 
= (2m)! (o? + |(1 — cos A) + isin A|? e) 
= (2m)! (0? (a cos A)? + sin? X) e?) 
= (2m) (o2 (1— 2cos A + cos? A + sin? A) o2) 
= (2m) (o242(1- cos A) o2) 
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For Model (M2), we have: 


(1— L)y = (m8) — (uc 63) 
= (fe — pii) + (Et — €) 
Bt—1 ne + (Et — £11) 


I 


(1 — L) y is not stationary because the process fj, is integrated of order 1. We have: 


Q-L5'yw = (1-2) (Gratme + (Gr 7 6) 
= (6-1 — i2) + Qi m) + (1 Lg 
= Gactü-Dmcrü-L?a 


Since this is the sum of three independent stationary processes, the stationary form 
of y, is equal to S (yj) = (1 — L)? y. We have*: 


Jae] = p-e?fa-ef 
= (2(1—cosA))” 
= 4(1— cosà)? 


We conclude that: 


E 02 + 2(1— cos A) o2 + A(1 — cos A)? o2 
2m 


In Figure 10.3, we have represented the spectral density functions of Models (M1) and 
(M2) when e; = oy = o¢ = 1. We observe that they are similar for low frequencies, and 
the difference between the two processes comes from the dynamics on high frequencies. 


. Ht is the stochastic trend, 8, is an AR(1) process that can be viewed as a mean- 


reverting component when ¢ < 0 and ^; is a stochastic seasonal process. When o, = 0, 
we have: 


Yt—-s+1 F- + W-1 HW =0 


^ Another way to find this result is to notice that (1 — Ly: = 1 — 2L + L2. Therefore, we have: 


| — 2e79^ 4 (e7)? ° 


Caer) | 


2 


|: — 7D 4 e-2iÀ 


= |(1—2cos\ 4- cos 2A) + i (2sin A — sin 2d)|? 

=  (1— 2cosA + cos 23)? + (2sinA — sin 2A)? 
1 — 4cos À + 4cos? A + 2cos2À — 4 cos À cos 2 + cos? 2A + 
Asin? A — 4sin Asin 2A + sin? 2A 

= 6 —4cosA + 2cos2AÀ — 4 (cos A cos 2A + sin Asin 2A) 

= 6 —A4cosA + 2cos 2d — 4cos (A — 2A) 

= 6—8cosAÀ--2cos2A 
4 — 8cos À + 2 (1 + cos2A) 

= 4-—8cosA-c2 (1 + cos? À — sin? à) 


= 4-—8cos AÀ + 4cos? à 
= A(1- cos À)? 
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55, 
ae 
7 
3.0 F VA 
7 
2.54 / 
Model (M1) / 
" — — Model (M2) / 


n/2 3n/4 T 
À 


0.0 


0 m/4 
FIGURE 10.3: Spectral density function of Models (M1) and (M2) 


Since we have yw, +... + ^ft-2 + ^1 = 0, we deduce that: 


= (gar yee ee) 
"Yt—s 


Vt 


We obtain a deterministic seasonal time series, where s represents the period length 
of a season. For example, if s = 4, we obtain: 


Yt = Vt-4 = Vt-8 — 

Vt+1 = Vt-3 = Vt-7 5... 
Vt+2 = Vt-2 = Vt-6 5... 
Ve+3 = Vt-1 = Vt-5 =--- 


The process repeats every four time periods. If oy Z 0, we have y541 +... + yii + 
^ = we and Yt—s +... + ^yg-2 + W-1 = 0-4. Therefore, we have: 


Yt 


We deduce that: 


E. sy] 


wi — (Ye-s41 +--- + 02a) 
We = (w1 m Fizy) 


Ye-s + (wt — w1) 


Ets [Mis + (wi — 006 3)] 
Yt—s 


It follows that ^ is a stochastic seasonal process. 
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5. We have: 


I 


Rt (1 = L) (1 — L?) Ut 
= (b) ee) ae) Be D) UE E yg] 


(1— D) (1 — D^) 


and: 


(l—D(l—L)y« = (Je — t-11) — (Yt-s — Yt-s—1) 
= (v — Yi-s)— (i-i — Yi-s—1) 
= (wi — w1) (wii — w2) 
= We 2w +w 
= (1-L) w 


We deduce that: 


a4 = ü-LD)m-crüuü-L)u-L)8- 
(Ec L) (1 = L°) e, + (0 oy 
If we assume that |ó| < 1, then f£, is stationary. Moreover, we know that m, c; and 


w are stationary. We conclude that z+ is stationary and S (yj) = (1 — L) (1 — D?) y: 
is a stationary form of yz. 


. Another stationary form of y; is (1 — L*) y+. Indeed, we have: 


(1 = L?) Yt = (1 L?) Ht t (1 L*) Pt t (1 L^) €t d (1 L) Wt 
and: 


(1—L*) pe = be h-s 
= (m-icm)- ns 
= m+ (pi-2 1-1) — Ht-s 
= M+ M-1+ +--+ M-s-1 


Ex 


We deduce that (1 — L°) u: and (1 — L*) y; are stationary. 


. We note gy (e(L)) = |e (e ^P. We have: 


g^ (1 — L5) = | = ecu 2 


E |1 P gisaj? 


= (1—coss\)* + sin? sd 
= 2(1-coss) 


and: 


gal- L-E) = g.0-L2):gA(1— P^) 
4 (1 — cos A) (1 — cos sA) 
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We remind that 


1 


g^ (a = eLy) = WE de 9p 
1 


(1 — $cos A)? + ¢? sin? A 
1 
1 — 2¢ġ cos À + ¢? 


We deduce that: 
1— L)(1—L? 
fs) = al- to (EAA) aa 
ga (1-20 - L*)) 02 +9 (0 - D?) e 
= 2 (1 — cos sà) o7 + 


4(1— cos A) (1 —cossA)Y , 
( t 


1 —2¢cosr+ ¢? nex 
4 (1 — cos à) (1 — cos sÀ) o2 
(4 — 8cos À + 4cos? A) o2, 


We have seen that: 
g(1—-2)(1—2)) = g(1-L-LI xL) 


4 —4cos À — 4 cos sÀ + 
2cos (s — 1) A + 2cos (s — 1) à 


By using the properties of trigonometric functions, we obtain: 
ga ((1— D)(1— L?)) = 4-—4cosr— 4cossA + 
2 (cos sà cos A — sin sà sin A) + 
2 (cos sÀ cos À + sin sA sin A) 


= 4-— 4cosA — 4cos sÀ + 4cos sÀ cos A 
= 4(1-— cos A) (1 — cos sà) 


'The spectral density function is then defined as follows: 


fsa (A) = c |(1— cos sà) o? + 
E 2—2cos A + Xj: (3 |j] — 2) cos (s + j) A ^ 
T 


1 — 2¢ġ cos À + ¢? 
277! (1 — cos À) (1 — cos sÀ) o2 
277! (1 — 2cos À + cos? À) o2 


10.3.16 Spectral density function of some processes 


We note gy (y(L)) = ly (e=) E 
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1. We have: 


g (1-2) = |.- cT 


T |1 = gm 


(1 — cos sà)? + sin? sà 
2 (1— cos si) 


Il 


Since we have (1 — L^) y; = €i, we deduce that: 


o2 


AO) = aa) 
2 


Te 


4r (1 — cos (sA)) 


2. We have: 


N 
E 


2 
(1— M 


Qq 
[UN VI 


= |1 = genre 


QN 2] 


(a — cos A)? + sin? X) x 


ony 


(2 (1 — cos 4)) 7 


on 3 


Il 
on 2 
LS NL ON 
N 
a: ; 
B 
| 
LA 
M 
& 


HR. SIS Slk I X] 


N 
E 


because?: 


duis — TCE >) cos (3 +3)) 
2 2 2 2 2 2 
1 
3 


3. We have: 
z —(1—9L) u + (1—0L) v 


We deduce that: 


ee o2 (1 — 20 cos A + 6?) o2 
f. )= zx — 26cos A4 9) ' Qn 


5We use the following trigonometric identity: 


sinasin 8 = ; (cos (a — 8) — cos (a + B)) 
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(a) The simulated time series z is represented in the first panel Figure 10.4. In the 
second panel, we also give the periodogram of z: 


Log = QP L1 Seal 
j= = Ze 
= 27n 27n = á 
where A; = 27 (j — 1) /n and j € {1,..., n}. 
Simulation of z, 
CO 
0 200 400 600 800 1000 
Periodogram |,(A) Spectral density function f,(A) 

6 6 

5 5 — True 

4 4 === Estimated 

3 3 


n/4 n/2 3n/4 T n/a n/2 3n/4 n 
À A 


FIGURE 10.4: The AR(1) + MA(1) stochastic process 


(b) The Whittle log-likelihood is equal to: 


Le 1 L (Aj) 

£(6,04,0,0,) = -nIn2n- 5 5 m fe (A) - 52, 

j=1 j=l 

where A; = 27j/net j € (0,1,...,n — 1). With the simulation in Figure 10.4, we 

obtain the following estimates: 9 = 0.755, 6, = 0.896, Ê = 0.120 and 6, = 0.595. 

'The true and estimation spectral density functions are given in the third panel 
in 10.4. 


Chapter 11 


Copulas and Dependence 


11.4.1 Gumbel logistic copula 
1. We recall that the expression of the Gumbel logistic copula is: 


U1 U2 


Uy + U2 — uiu» 


C (u1, ua) = 


We have: 


U2 (ui + uz — uuz) — u1u» (1 = ua) 


O1 C (u1, u2) = 5) 
(u1 + u2 — urua) 


2 
uz 


(u1 + u2 — wuz)” 
We deduce that the copula density is: 


c(u,u2) = 8C (u, u2) 
Que (u1 + u2 — uuz)’ — 2u2 (uy + u2 — u1u3) (1 — u1) 


(ui + u2 — uua)^ 


2u U2 


(u1 + U2 — uua)? 


2. We have: 


1—2u-4 C (u,u) 
l—u 
(1 —2u)(2—u)+u 
(iu) — vj 
2u? — 4u + 2 
u? — 3u + 2 


At (u) - 


Using L’Hospital’s rule, it follows that: 


Qu? — 4u 4-2 

+ M . 

à Pt nm u2? — 3u + 2 
4u — 4 

im 

u>1 2u— 3 


= 0 


The Gumbel logistic copula has then no upper tail dependence. For the lower tail 
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dependence, we obtain: 


= il ems 
v0 2— 2u 
1 


2 


We verify that it has a lower tail dependence. 


11.4.2 Farlie-Gumbel-Morgenstern copula 
1. We have: 


C(u,0) = C(0,u) =0 
C(ul) = C(Lu)-u 
0? C (u1, u2) 


14-0(1—2 1-2 
Bu DIS +0( u1) ( u2) 


As we have —1 € 1 + 0 (1 — 2u1) (1 — 2u2) < 1, it follows that: 


3C (u1, ua) > 
ð u10 u2 


We deduce that C is a copula function. 


2. We have: 
T 1 — 2u + C (u, u) 
u1l- l—u 
1—2u 4 u2 (ioa -u) 
—  ]im 
u1- l—u 
= lim (l—u)(1+6u? 
Ee) 
= 0 
For the Kendall’s tau, we obtain: 
T= 
The Spearman’s rho is equal to: 
Q — 


3. We calculate the conditional copula: 
We simulate (U1, U2) in the following way: 
By applying the PIT method, we obtain: 


zy = pto' (uy) 


X2 = on (1 — u2) 


Copulas and Dependence 
4. We have: 


0? C (u1, U2) 
OQ u10 uz 


c(u1,U2) = 


It follows that: 


f(@1,%2) =  c(Fi(z1), F2 (22)) x fo (a1) x fı (21) 


- eem mcr) n). 
(==) x enh 


Lo= ninà- 2mo? — = In2n 4 


ym (1 Fe (1 2 Ge 3) (Qe? — )) = 


1 7 Tii — 2 7 
D A ) a as 


i=l 


We deduce that : 


11.4.8 Survival copula 
1. We have S (0,0) = 1 and S (co, 00) = 0. We notice that: 


0158 (21, £2) 


I 


< 0 
We conclude that S is a survival function. 
2. We have: 
Sı (xı) = S(a,0) 
= exp(-m) 


By noting U; = Sı (X1), we deduce the expression of the survival copula: 


mul 
Cursus). = exp (- (mu = mus coge) 


In uy + In ug 
Uy U2 
= uru? exp (0 ) 


Uy + Ug 


with à = —1nu. 
11.4.4 Method of moments 


C (xi, X3) —0C + (1 — 0) ct 
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1. We have: 


F (x1, £2) = 0 x max (® (zi) + ® (x2) — 1,0) + (1 — 0) x min (6 (z1) , ® (x2)) 


It follows that: 


HXQXQ = ECT IS 
> ax ff sissac- NE (i) + 


(1— 0) x Ji 2123 dC ^ (® (x1), ® (x1)) 


= 0 x(—1)+ (1—0) x (+1) 
= 1-20 


We deduce that: 


p (X3, X2) =E [X1 X3] —1]-20 
The linear correlation between X; and X» is equal to zero when 9 takes the value 1/2. 


2. Using the notations N; ~ V (0,1) and N2 ~ N (0, 1), we obtain: 


p(Xi, X2) = p(m +o1N1, u2 032) 
p (Ni, Na) 
1 — 20 


Il 


3. We have: 


g- ice X0 
E 2 


The MM estimator Îmm is then equal to: 


» 1-6 
bum = = 


where 9 is the empirical correlation between X; and X». 
11.4.5 Correlated loss given default rates 


1. As we have x € [0,1], the parameter 7 must be positive or equal to zero in order to 
have F (0) = 0, F (1) = 1 and f (x) = ya?-! > 0. 


2. The expression of the log-likelihood function is: 
£9) = > mf (z) 
i=1 
= 5 In (2277) 
i=1 


= nlDny-c(y— 1) S Ina; 
i=1 
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We deduce the first-order condition: 


dL(y) n QN 
— —— =08-+ Ina; =0 
We finally obtain the ML estimator: 


" 1 
IM CSS SR) 


. We have: 
1 
[LGD] = J ryx! da 
0 
1 
Sey | x’ dx 
0 
Fa, 
=T 
y+1 0 
zx a 
y+1 
Let z be the empirical mean of the sample {21,...,2,}. The MM estimator 4m 
satisfies the following equation: 
ÂÎMM x 
~~ =& 
4m + 1 
We deduce that: 
x T 
YMM toe 


I 


. In the case z; = 5096, we obtain: 
AML = ——— =]n2=1.44 
n 


and: 
4um = 0.5/ (1 — 0.5) = 1.00 


The numerical results are different. For example, we have reported the density function 
of the two probability distributions in Figure 11.1. 


. We have: əc ) 
u1, U2 = = 
, = use 0lujlnnuo _ buz In uae 8 ln u1 In ua 
Qui 
and: 
3C u1, U2 
( D ) =e 9hulnus. gin ye Fn nue _ Aln uze? mt muz 
OQ u10 uz 


Qe 91» ui ln u2 TE 0? ln ui ln use? u1 ln u2 


= (1 —0—01n (u1u2) + 9? In a4 In uz) e 9i In u2 
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FIGURE 11.1: Density functions associated to ML and MM estimators 


6. The bivariate density function is the equal to: 


flay) = e(F (a), F(y)) x f(x) x f (y) 


(1-0-0(yna+72ny) + 0?s151n zln y) x 
e ?no2lnzlny y yır! x yoy?! 


where ^, and 72 are the parameters associated to the risk classes Cı and C2. It follows 
that the log-likelihood function is equal to: 


L = nhn^o nlnyo 4 (ym — 1) nz; 4 (9o — 1) Sony, + 
i=1 


i=l 
n 


y» (1 — 0 — 0(y4In zi + y2ln yi) + 67172 In z; In y;) — 
i=1 


07172 5 ln z; In y; 


i=1 
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7. The first-order conditions are: 


Ok n = 

— = —+ In z; + ya, 0 

Ja a 2. 91 (V1; 72,9) 

Ok n i 

MESI s — NEZ In à , ,0 

032 72 2. Vi + 92 (V1, V20) 

OL — S 1+ (iln z; + y2 ln yi) — 269172 In z; In y; 

00 Ex 1—0 — 0 (y1 ln zi + yoln yi) + Pyne; In yi 

^ni ^yo y» zilnyi 
i=1 
with: 
( TEs 5 (0245 In y; — 0) In zi 
PEU S B rum 1—6—60(^41nzj + y2 ln yi) + 6?31*5 ln z; ln yi 
0^5 Soin x, ny 
i=1 
2 (0254 Ina; — 0) In yi 
0 = 
92 (11 92:0) 3 1—6—60(^41nzj 4 y2 ln yi) + 6?q1*391n z; In y; 
0^ y» x, ny 
i=1 


When Ó is equal to zero, we have gi (71, 72,0) = 0. In this case, the estimator 41 
corresponds to the ML estimator 4r. When we have 6 Z 0, we obtain gı (71, 72,9) Æ 
0 and 41 4 4x. We obtain this result because more information is available in 
the bivariate case. The ML method can then correct the estimator ^wr by taking 
into account the dependence function between LGD, and LGD». For instance, if the 
estimated copula is equal to the Fréchet upper copula Ct, it is obvious that the two 
estimators 5; and 42 are equal, even if the unidimensional ML estimators are not 
necessarily equal. Let us consider the following sample: 


LGD, (in%) 50 40 60 50 80 90 70 10 40 40 
LGDs (in%) 60 50 80 70 80 90 80 30 50 70 


We obtain ^wr, = 1.31 for C4 and IML = 2.18 for Cə. With the bivariate ML method, 
we obtain 41 = 0.88, 4» = 1.44 and 0 = 1.71. 


11.4.6 Calculation of correlation bounds 


1. We have: 
C (ujug) = max(u;-us-— 1,0) 
C-(ui,u9) = wu» 
CT (ui,uz) = min(ui, ua) 


Let X4 and Xə be two random variables. We have: 
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(i) C (X41, X2) = C7 if and only if there exists a non-increasing function f such that 
we have Xs = f (X1); 
(ii) C (X, X2) = C+ if and only if X, and X; are independent; 


(iii) C (X1, X3) = C* if and only if there exists a non-decreasing function f such 
that we have Xə = f (X1). 


2. We note U; = 1 — exp(—A7) and Uz = LGD. 


(a) The dependence between 7 and LGD is maximum when we have C (T, LGD) = 
Ct. Since we have U, = Uz, we conclude that: 


LGD +e-*7 —1=0 
(b) We know that: 
p (T, LGD) € [Pmin (T, LGD) ; Pmax (T, LGDJ| 


where Pmin (T, LGD) (resp. Pmax (T, LGD)) is the linear correlation corresponding 
to the copula C^ (resp. C^). It comes that: 


=)=} 
and: 
: 1 
LGD] = 5 
1 
o (LGD) = 19 


In the case C (T, LGD) = C7, we have U, = 1— Us. It follows that LGD = e~?7. 
We have: 


[r LGD] = E[re ^] 

= J te-™ Xe7?t dt 
0 

= J the? dt 
0 


= |- 2 oe E 1 n E "A 
MEE, 


We deduce that: 


Pmin (T, LGD) 


Il 
e REN. 
EH 
S 
D a 
ATN 
ime 
I= 
Ses et 
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In the case C (T, LGD) = Ct, we have LGD = 1 — e~>7. We have: 
Eee 7] 


= j t (1 — gn) Ae dt 
0 
f Deo dt — n fAe- dt 
0 0 
_ tee + eta) — 1 
0 0 4A 


Pp 
X pe. D 


z [r LGD] 


I 


Il 
o 


We deduce that: 


II 


Pmax (T, LGD) 


(a 3)/ G a) 


We finally obtain the following result: 


lptr.LGp) < Y 


(c) We notice that |p (T, LGD)| is lower than 86.696, implying that the bounds —1 
and +1 can not be reached. 


(a) If the copula function of (7,72) is the Fréchet upper bound copula, Tı and T2 
are comonotone. We deduce that: 


U, = Ug 4 1-e VT = 1] — ee 


and: 


(b) We have U; = 1 — Up. It follows that S1 (r1) = 1 — S2 (72). We deduce that: 
e ^n mE 1 = e ^72 
and: 
-In (1 — em) 
Ay 


There exists then a function f such that 7 = f (72) with: 


Ti = 


—In (1— e?) 


f= 


(c) Using Question 2(b), we known that p € [Pmin, max] Where Pmin and pmax are 
the correlations of (71, T2) when the copula function is respectively C^ and C*. 
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We also know that p = 1 (resp. p = —1) if there exists a linear and increasing 
(resp. decreasing) function f such that 7, = f (T2). When the copula is C+, we 
have f (t) = xt and f'(t) = E > 0. As it is a linear and increasing function, 
we deduce that pmax = 1. When the copula is C^, we have: 


—In (1— e772") 


f= 


and: 
Age ^?! In (1 — gc 


f'(t- so (eer = 7 0 


The function f (t) is decreasing, but it is not linear. We deduce that pmin 4 —1 
and: 
-l<p<l 


When the copula is C^, we know that there exists a decreasing function f such 
that X2 = f (Xi). We also know that the linear correlation reaches the lower 
bound —1 if the function f is linear: 


X» =a + bX, 
This implies that b < 0. When X takes the value +00, we obtain: 
X2 =a+bx oo 


As the lower bound of Xə is equal to zero 0, we deduce that a = +00. This 
means that the function f(x) = a+ bx does not exist. We conclude that the 
lower bound p — —1 can not be reached. 


Xı + Xə is a Gaussian random variable because it is a linear combination of the 
Gaussian random vector (X1, X2). We have: 


S [Xi + Xe] = 1n + be 


and: 
var (X + X5) = 0? + 200402 + 02 


We deduce that: 
Xi + Xo vN (ua + 12,02 + 2p0102 + oj) 


We have: 
COV (Yi, Y2) = J [Y3 Y3] -E [Y>] y [Y2] 
= E [e™:+*] — E [Y2] E [Y2] 
We know that e¥1+%2 is a lognormal random variable. We deduce that: 


1 
J [entre] = exp ( V [X1 + Xo] + 2 var (Xı + xj) 


I 


1 
exp (m + H2 + z (ei + 200102 + b) 
= git $71 ga $03 opoio2 


We finally obtain: 


COV (Xi, Y3) = elit $91 oa 305 (epe 192 = 1) 
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(c) We have: 


1.2 1.2 
ei t3?1 eH2t*2393 (e071 92 i) 1) 


yenit (e*t — 1) ertet (ev? — 1) 


c^7102 =] 
Ver —1Ve% —1 


(d) p (Y1, Y2) is an increasing function with respect to p. We deduce that: 


p (Yi, Y2) 


p (Yi, Y) = 1 4> p = 1 and e1 = 02 


The lower bound of p (Y1, Y2) is reached if p is equal to —1. In this case, we have: 


goma 


= > 
VE — 1ye% —1 


1 


p (Y1, Ya) 


It follows that p (Y1, Y) Z —1. 
(e) It is evident that: 


e9?122t —1 
Vert = 1 esit S 


In the case 91 = o2 and p = 1, we have p (Sj (t) ,S3 (t) = 1. Otherwise, we 
obtain: 


p (S1 (t) , S2 (t) 


Jim p (Si (t), S2 (0) = 0 
(f) In the case of lognormal random variables, the linear correlation does not neces- 


sarily range between —1 and +1. 


11.4.7 The bivariate Pareto copula 
1. We have: 


F, (x1) = Pr [Xi < zı} 
= Pr{X; < z1, X2 < co} 
= F(z,o) 


We deduce that: 


Fy (xı) 


Il 
= 
E os 

D 

S 
S| + 
8 

Ex 
NAI 
| 
R 
oN 
D 

N 
$c 
NILUM 
| 
R 

+ 


(Apa. ae 1)" 


Il 
LA 
| 
P ara 
D 
Hir 
$e 
8 
es 
Nu ou 


We conclude that F; (and F2) is a Pareto distribution. 


2. We have: 
C (u1, ug) = F (F1! (ui) F3" (u2)) 
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It follows that: 


We deduce that: 


C (u1,u2) = 1-—(l—wu,)-—(1—u2)4 


3. We have: 
OC (u1, U2) "e —1/a —1/a TA 
ya o 1-a ((1-u) Soles) -1) x 
(-i) Gaal) 
a 
-—a-l1 
= 1- (a - u) "^ + (1 = ug) V7 - 1) x 


(1 _ wor 


We deduce that the probability density function of the copula is!: 


c(u1, u2) = 


(1 — ui — us + urug) 97d 


In Figure 11.2, we have reported the density of the Pareto copula when a is equal to 
1 and 10. 


1 Another expression of c (u1, u2) is: 


unda: vts (222) CEP 
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Tail dependence 
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—— Upper 
0.6 aem Lower 


4. We have: 
Pain ou 
u-0t u 
=2 lim PCl) 
u—0t Qui 
=2 lim 1- (a u) eu Va 1) = (Les ui 
u— 0+ 
= 2 li 1—1 
mee 
=0 
and: 
dw — lm 1 — 2u + C (u, u) 
u—l- l-u 
(a u) Wert u) ^ 1) 
— lim 
u1- l—u 
= A _ u l/a = 
= lim (1+1-(1-w)¥*) 
= 9€ 


The tail dependence coefficients A^ and A* are given with respect to the parameter 
a in Figure 11.2. We deduce that the bivariate Pareto copula function has no lower 
tail dependence (A^ = 0), but an upper tail dependence (A* = 27%). 


5. The bivariate Pareto copula family cannot reach C^ because A^ is never equal to 1. 
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We notice that: 


lim At =0 
a->oo 

and 
lim At =1 
a0 


This implies that the bivariate Pareto copula may reach C+ and Ct for these two 
limit cases: à — oo and a — 0. In fact, œ — 0 does not correspond to the copula Ct 
because A^ is always equal to 0. 


6. (a) We note U; = F1(Xi) and Ug = Fə (X2). X1 and X» are comonotonic if and 
only if: 
U =U, 


We deduce that: 


is 05 -- X9N ” jus 05+ X,\ ^. 
05 i 61 
05 + Xo | 04 4- X4 EH 
02 = 0 
01/02 
m x -e ( (45) -1) 
1 


We know that o (X4, X2) = 1 if and only if there is an increasing linear relation- 
ship between X, and X». This implies that: 


o1 
T dle 
ag 


(b) X; and X» are countermonotonic if and only if: 


Uz=1-U, 


05 + X5 aoa 0; +X, uu: 
05 ud 61 
05-- X3N 7? 007 Xi1N 7 
—1- 
m ( 02 ) ( 0 ) 
1/ag 
Ary ma 
-rafa 
1 


It is not possible to obtain a decreasing linear function between X; and X2. This 
implies that p (X1, X2) > —1. 


(c) We have: 


We deduce that: 


F’(21,%2) = OC(Fi(zi),Fo(22)) 
btr ™ 02+ 22\ ^7 
= 1 
( 01 ) ( 02 ) < 
Oit oe 05 + Lo ldem 7 
0 0» 
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The traditional bivariate Pareto distribution F(ri,r2) is a special case of 
F’ (x1, £2) when: 
Q1 = Q2 = Q 


Using F' instead of F, we can control the tail dependence, but also the univariate 
tail index of the two margins. 


Chapter 12 


Extreme Value Theory 


12.4.1 Uniform order statistics 
1. Since we have f (x) = 1 and F (x) = z, we deduce that: 


n! 1 hag n-l. 


a I'(n-4 1) i-1 n—i 
7 WUPEIu 629 


This is the probability density function of the Beta distribution B (a, 8) where a = i 
and 8 —mn-—3i-4 1. 


2. We have: 


3. We have: 
var (Xin) = var(B(üin-i-4-1)) 
ap 
(a+ 6)’ (a+ +1) 
i(n—i-41) 
(n D (n2) 


4. We have: 


Sample E 3 4 5 6 7 8 


0.04 0.14 0.24 0.34 0.45 0.55 0.72 0.94 
0.12 0.25 0.31 0.32 0.57 0.64 0.69 0.97 
0.11 0.17 0.17 0.26 0.50 0.50 0.69 0.85 
0.00 0.03 0.15 0.53 0.58 0.77 0.98 0.98 
0.15 0.25 0.46 0.62 0.65 0.74 0.85 0.89 
0.05 0.07 0.15 0.25 0.65 0.74 0.86 0.93 
0.12 0.16 0.33 0.34 0.55 0.61 0.63 0.95 
0.01 0.11 0.14 0.47 0.57 0.82 0.87 0.96 
0.27 0.55 0.57 0.68 0.73 0.78 0.83 0.85 
10 0.28 0.40 0.68 0.89 0.91 0.94 0.99 0.99 


oo-10» 0:4 C5 l2 


e 


The empirical and theoretical mean and standard deviation of X; are reported in 
'Table 12.1. 
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TABLE 12.1: Empirical and theoretical mean and standard deviation of X;.s 


Xi:g V[Xas] (Xin) d (Xen) 
0.1150 0.1111 0.0981 0.0994 
0.2130 0.2222 0.1584 0.1315 
0.3200 0.3333 0.1918 0.1491 
0.4700 0.4444 0.2096 0.1571 
0.6160 0.5556 0.1302 0.1571 
0.7090 0.6667 0.1333 0.1491 
0.8110 0.7778 0.1241 0.1315 
0.9310 0.8889 0.0511 0.0994 


oo-1o0» 0t cor. n|- 


. We reiterate that Xin ~ B (i,n — i+ 1). We deduce that the median statistic follows 


a symmetric Beta distribution: 
Xkpin ^v B(k+1,k+1) 


Moreover, we have: 


Xin ~ B (i, 2k — i) 


It follows that the density function of Xi:n is right asymmetric if à < k, symmetric 
about .5 if i = k + 1 and left asymmetric otherwise. 


. We consider the change of variable: U = F(X). It follows that U follows a uniform 


distribution. Using the previous results, we can deduce that the density function of 
Ui:n is right asymmetric if i < k, symmetric about .5 if i = k + 1 and left asymmetric 
otherwise. Because F (x) is a symmetric function about z* = F^! (0.5), we conclude 
that the density function of X;., is right asymmetric if i < k, symmetric about z* if 
i = k + 1 and left asymmetric otherwise. 


12.4.2 Order statistics and return period 


1. We have: 
F,,(r) = Prímax(X;,...,X4) € x] 
= Pr(X4 € z,..., X. € x] 
= [[Prtxizs) 
i=1 


E 


2. The density function of Xn:n is equal to: 


fanin (£) = Os Fax) 
- H(A 


We deduce that the log-likelihood function of a sample (z1,...,2,,) of the order 
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statistic Xņn:n is equal to: 


iy ua. 
Lain = mln — + In (27) T no? DD (* r) 


(n-1)nð (==) 


For each time period n, we calculate n:n and find the estimates fin:n and 65.4. Then 
we test the joint hypothesis: 


|] fan = [22—Ífa3-—...—nu 
Ho = A A. n 
01:1 02:2 03:3 toile oO 


3. The return period is the average period between two consecutive events. It is equal 
to: 


sd 
p 


where p is the occurrence probability of the event and n is the unit period measured 


in days. We have: 
1 
T (Fan (@)) = xn 


"^ l-a 


We deduce that the return periods are respectively equal to 100, 100, 500 and 2 200 
days. 


4. We would like to find the value a that satisfies the following equation: 
T (Ezz (a) = T (F^ (99.9%)) 
We have: 


We deduce that: 
a = 1 — 20 x 0.001 = 98% 


12.4.3 Extreme order statistics of exponential random variables 
1. Using the Bayes formula, we have: 


Pr{r>tf\r >s} 


Pr{7>t|7>s} 


Pr {7 > s} 

- Br 
| Prír» s) 

S (t) 
~ S(s) 

eX 

es 
= gMt-s) 
= Pr{r>t-—s} 


This implies that the survival function does not depend on the initial time. This 
Markov property is especially useful in credit models, because the default time of the 
counterparty does not on the past history, for instance the age of the company. 
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2. We have: 
Pr {min(71,...,7) >t} = Pr {mi > £ ss m > t) 
EE Į [Pr {7; >t} 
i=1 
= exp (- 5 v) 
i=1 
and: 


Pr {max (71,...,7) Xt] = Pr {mi exe et] 


I 
Es 


Pr {7 zt) 


©. 
Il 
m 


(1 = gue 


Il 


" 
Il 
un 


We deduce that: 
min (1,...,74) ~ E (>: x) 
i=l 


The distribution of max (71,..., T4) is not a known probability distribution. Let us 
consider the case n = 2. We have: 


Pr {min (T1, T2) = T2} = Pr{tm%<7,} 


[ ria Aye" MEL p Nata dt, dts 
[a Aie Mh D. AÀ2e A2te ats) dt, 


= j yew (1 = poe) dt 
0 


i, Aie M dti -a f e- ArtAz)t dti 
0 0 


II 


We can generalize this result to the case n > 2 and we finally obtain: 


; ^i 
Pr [min (Tı, — Tn) = Ti} = DT 


3. When 7; and 7; are comonotone, we have S, (71) = S; (7i). It follows that: 


At 
TS Fal 


^i 
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We note At = max (A1,..., Àn) and A^ = min (A1,..., An). We deduce that: 


i UN : Ar, > t 
TOM. Se es 
dp hee ae 1:61 2 


Pr (min(rj,...,7,) >t} = Pr 


and: 


Pr {max (,...,7,) € t) 


Il 
"y 
= 
T 
a 
IA 
c 
ppm, 


Í 
T 
| 
© 
ia 
'a 
SIS 
> 
= 


We finally obtain: 
min (1,...,7T4) YE (A*) 


and: 
max (71,..., Tn) YE (A7) 


12.4.4 Extreme value theory in the bivariate case 


1. An extreme value copula C satisfies the following relationship: 
C (uj, u$) = C (u1, u2) 
for all t > 0. 
2. The product copula C+ is an EV copula because we have: 


C^(uuj) = ujus 


For the copula Ct, we obtain: 


E M NE T 
C (ui, ul) = min (uj, u$) 
t . 
_ ui if ur < ue 
ub otherwise 


However, the EV property does not hold for the Fréchet lower bound copula C^: 


C^ (uj, u$) = max (uj + u$ — 1,0) # max (uy + u2 — 1,0)! 
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Indeed, we have C^ (0.5, 0.8) = max (0.5 + 0.8 — 1,0) = 0.3 and: 


C- (0.57,0.87) = max (0.25 + 0.64 — 1,0) 
= 0 
ses 0.3? 
3. We have: 
0 9311/0 
C(u,uj) = exp - [Cn ut) + (7 mu] ) 


= (elma mn 
= ou (u1, u2) 
4. The upper tail dependence A is defined as follows: 


Ja 1 — 2u + C (u1, u2) 


uit l1—u 


It measures the probability to have an extreme in one direction knowing that we have 
already an extreme in the other direction. If A is equal to 0, extremes are independent 
and the EV copula is the product copula C+. If A is equal to 1, extremes are comono- 
tonic and the EV copula is the Fréchet upper bound copula Ct. Moreover, the upper 
tail dependence of the copula between the random variables is equal to the upper tail 
dependence of the copula between the extremes. 


5. Using L'Hospital's rule, we have: 


1 —2u 4 e (my c m]? 


A = I 
i Ica 
1 — 2u + e PO mw] 
= lim 
ult l—u 
1 — 2u + we 
= lim 
ult 1—u 
Q— 24 21/0,2"*-1 
— lim 
u—1t —1 
= lim, 2-2V/9,2*-1 
xr 
= 9 — 91/0 


If 0 is equal to 1, we obtain A = 0. It comes that the EV copula is the product 
copula. Extremes are then not correlated. This result is not surprising because the 
Gumbel-Houggard copula is equal to the product copula when 0 = 1: 


-[C In ui)!4(— In uz)!]* 


e = U1U2 = ct (u1, u2) 


6. 
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(a) We have: 
C(uiad) — aj MIC main (ul 0) 
(a) (^) (mn Qr) 
- (utut (usaf) 
C 


(u1, u2) 


1 — 2u + ul-%1y1-% min (uf: , uf 
A = lim ( ) 
ult l—u 
1 — 2u + ul 81 yt 92 yt 
= lim 
ult l—u 
i 1 — 2u + u2-% 
= lim ————— _ 
ult l—u 
` 0 — 2 + (2 — b2) ut? 
= lim 
ult —1 
= lim 2—2u'- + zu!’ 
u—lt 


If 05 > 0,1, we have \ = 041. We deduce that the upper tail dependence of the 
Marshall-Olkin copula is min (01, 02). 

(c) If 0; = 0 or 02 = 0, we obtain \ = 0. It comes that the copula of the extremes is 
the product copula. Extremes are then not correlated. 


(d) Two extremes are perfectly correlated when we have 01 = 05 = 1. In this case, 
we obtain: 
C (u1, ug) = min (u1, v2) = C* (u1, u2) 


12.4.5 Max-domain of attraction in the bivariate case 


1. 


Let (X1, X25) be a bivariate random variable whose probability distribution is: 
F (21,22) = C; x, x, (Fi (21) , F2 (22)) 

We know that the corresponding EV probability distribution is: 
G (21,22) = C(x, x, (G1 (21) , Ga (v2) 


where G4 and Gg are the two univariate EV probability distributions and Cry, X3) 
is the EV copula associated to C(x,,x,. 


(a) We deduce that: 


G (21, 22) C^ (Gy (21) , Ga (22)) 
mE A (21) V4 (a2 — 1) 


= exp (7e ^ + 22 — 1) 
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(b) We have: 
G(z,z) = A(x) ®, (+=) 
= exp (2n (LES : 
(c) We have: 
G(a1,%2) = Wi(zxi—1)9, (1 + =) 


= exp (n-i- 6-27) 
a 


2. We know that the upper tail dependence is equal to zero for the Normal copula when 
p « 1. We deduce that the EV copula is the product copula. We then obtain the same 
results as previously. 


3. When the parameter p is equal to 1, the Normal copula is the Fréchet upper bound 
copula C*, which is an EV copula. We deduce the following results: 


G (z1, £2) = min (A (21) , V1 (x2 — 1)) 
= min (exp (—e ^?!) , exp (£2 — 1)) (a) 


2) 


Q 
—— 
8 
m 
8 
t 
— 
Il 
E 
B 
Lum 
> 
P stus 
8 
e 
e 
Q 
= ASN 


din c (ks — 1), exp (- (1+ 2)7) (c) 


4. In the previous exercise, we have shown that the Gumbel-Houggard copula is an EV 
copula. 


(a) We deduce that: 


G(a1,22) = e-[C nA C 1n tae] 


T" (- fe“ + (1 - 2 i Ü 


(b) We obtain: 


Güngg = e [C mA- mn e (1 22))*] 7 
1/0 
i E “Hs T2 —ad 
= on r +(1+ 2) | 
(c) We have: 
Gears) = e [C Penh mese)" 


Chapter 13 


Monte Carlo Simulation Methods 


13.4.1 Simulating random numbers using the inversion method 


1. Let u; be a uniform random variate. 


(a) We have seen that the quantile function of the distribution function GEV (u, c, €) 
has the following expression: 


G-t (a) =p- t (1 —(- Ina) 5) 


It follows that: 


zti} u z(t ( Inu) *) 


(b) The cumulative density function of the log-normal distribution £N (u, 0°) is 


equal to: 
F(z)- 9 (==) 


o 


We deduce that: 
F^! (u) = exp (u + o^ (u)) 


To simulate a log-normal random variate, we then use the following algorithm: 


£i < exp (u +007! (u;)) 


(c) We have: 
1 
PO) = lay? 
and: m 
F(u)-a (+) 


To simulate a log-logistic random variate ££ (a, 8), we use the following trans- 
formation: 
GA) 
Ti — Q 
1— Ui 


2. (a) Let z; be a random variate simulated from the probability distribution of X. A 
straightforward algorithm is to keep all the random variates z;'s that are higher 
than the threshold H: 


l; — 


a missing value otherwise 
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(b) We have: 
Fz (x) = Pr{X <a|X>H} 
o Pr{X<z,X >H} 
, Pr{X > H} 
_ Fx (#)—Fx (H) 
bP GH) 
(c) We have: 


It follows that: 
Fy (u) = Fx’ (u + Fx (H) (1 — u)) 


We deduce that the algorithm to simulate the random variate l; is: 


li € Fx! (ui + Fx (A) (1 — ui) 


Pss 
e 


Concerning the first algorithm, we simulate nx values of X, but we only kept on 
average np = nx (1 Fx (H)) values of L, meaning that the acceptance ratio 
is equal to 1 — Fy (H). For the second algorithm, all the simulated values of u; 
are kept. For instance, if F x (H) is equal to 9096 and we would like to simulate 
one million of random numbers for L, we have to simulate approximatively 10 
millions of random numbers in the first algorithm, that is 10 more times than 
for the second algorithm. In this case, the acceptance ratio is only equal to 10%. 


When X follows a log-normal distribution LM (u, o°), Algorithm (a) becomes: 


— 
[o] 
— 


l; — exp (u +o67! (u;)) 


with the condition u; > Fx (H) = 95.16%. For Algorithm (b), we have: 


l; — exp (1+ pm (u EUR i) 


In Figure 13.1, we have represented the random numbers l; generated with the 
two algorithms. We observe that only 4 simulated values are higher than H in the 
case of Algorithm (a). With Algorithm (c), all the simulated values are higher 
than H and it is easier to simulate a random loss located in the distribution tail. 


3. (a) Let x; be a simulated value of X;.We have: 
Lin = min(2,..., En) 


and: 


(b) We have: 
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x 108 


0.8 * Algorithm (a) 
* Algorithm (c) 


0.7 H 


Simulated losses 
io) 
Cn 
T 
+ 


0.34 E 


* 
0.1 F m "MEL id 


5 
e 
m 

a lama maa sem oo | eoo °°) J 


0.0 0.2 0.4 0.6 0.8 1.0 


Uniform random variates 


FIGURE 13.1: Simulation of conditional losses L = X | X > H 


d Fr (u) = F! (a-a—u)/") 


lin 


We deduce that a simulated value z; of X, is given by: 
ay eF! (1 ie uu) 
For the maximum order statistic Xn:n, we have F1:n (x) = F(x)” and: 
cl +F! (ui") 


(c) In Figure 13.2, we report 1000 simulated values of X1.59 and X50:59 when X; ~ 
N (0, 1). 


13.4.2 Simulating random numbers using the transformation method 
1. The density function of Y = h (X) is given by the following relationship: 


dx 


g (y) = f (x) 2m 


We obtain: 
OQ ,,—a—1l5—/z 
gy) = See 
Bry ote -B/e 
T (a) 
payit 
T (a) 
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+ 
E ++ me $ E (s pi 
+ + + 
na up dy, AE ram att 
Rus tte # pr (EE d En Hm 
AND NEL ESAE 
5 
oe odd qu uide 
PUT EEE E + + 
T +t + ues FAT g 


+ 
* i BáGRS 4 
Netus hes 
"E update E m 
US REO, S OE 


uds PT en [i RT RUE 
DRE. CAMS eee $ 

F E At +e d * + dte 
AFER LUE SI e 

d a qh o CHE T ee 

ipu + = i + 

A E 

L L i 1 L L J 
400 600 800 1000 


FIGURE 13.2: Simulation of X 1:50 and X 50:50 when Xj at N (0, 1) 


It follows that Y ~ G (a, 8). To simulate X, we draw a gamma random variate Y and 
set X =1/Y. 


2. (a) The density function of X ~ G (a, B) is equal to: 
B*g*-lg-Pe 


In the case a = 1, we obtain: 
f(x) = 8e" 
This is the density function of E (8). To simulate X, we apply the following 
transformation: 
= Inu 
quium 
B 


where u is a uniform random number. 


(b) We know that: 
9 (n, B) = dG (1, 8) 
j=l 
G(n,B) - 3, E 
i=1 


where E; ~ E (8) are iid exponential random variables. We deduce that the 
probability distribution G (n, 8) can be simulated by: 


se cg 


We deduce that: 


— 


Ww 


— 
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Or: 
TL 
1 
x 4+ ——1ln II Uy 
B i=1 
where uj,..., Un are tid uniform random variates. 


Let Y ~ G (0,9) and Z ~ G (8,6) be two independent gamma-distributed ran- 
dom variables. We have: 


xui _ RTE EE 
T T (a)T (8) 
We note: 5 
X — 
Y+Z 
and: 
S=Y+Z 


It follows that Y = XS and Z = (1 — X) S. The Jacobian of (y, z) = ọ (a, s) is 


then equal to: 
E x 
Jo = ( —s l-r ) 


Since we have det J, = s, we deduce that: 


I 


fv.z (y, 2) x Is] 
óc 8 


T (a) (8) 


Uer 0-9): 


( jure d) 
T (a+ 8) 
= fx (2) fs (s) 
It follows that the random variables X and S are independent, X ~ B (o, 8) and 
S c G (a -- gB, ð). 


'To simulate a beta-distributed random variate, we consider the following trans- 
formation: 


fÍx,s (x, 8) 


I 


y 
ytz 


zc 


where y and z are two independent random variates from G (a, ô) and G (8,9). 
We remind that: 
Ixy (2, y) = fro (r.6) | 


det Jy 
where J, is the Jacobian associated to the change of variables (x, y) = ọ (r, 0). 
We have: 
J = cosÜ —rsinð 
? \ sinf  rcos0 
and: 


det Jj = r cos? 0 + rsin? 0 =r 


256 Handbook of Financial Risk Management 


Since R and © are independent, we have fno (7,9) = fr (r) fo (0). Moreover, © 
is a uniform random variable and we have: 


1 
fo (9) = = 
We deduce that: fa (n) 
" 
[xv (a, y) = 2 
TT 
We also notice that: 
X? +Y? = RB?cog?0  R?sin?O 
R? 


Finally, we obtain the following result: 


fr (v r? + "3 
q,y) = ——————— 
[x ( y) 2n /22 + y? 
Concerning the density function of X, we have: 
co fn (v x? + y?) 
T= — d 
fe@= faa 


(b) We assume that R = /2E where E ~ € (1). 
i. We have: 


Fr(r) = Pr(V2E <r} 
E 


We deduce that: 


ii. We have: 


-— 2m 
-s?[2 qoo o-y*[2 

7 V2T Jo V2 E 
ene /2 
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We deduce that X ~ N (0,1). By symmetry, we also have Y ~ N (0,1). 
Moreover, we notice that X and Y are independent: 


fx,y (x,y) 


eG) 


= fx (x) fy (y) 


iii. We have R = /2E = /—2In U, and © = 27U2 where U and U2 are two 
standard uniform random variables. It follows that X and Y defined by: 


{ X = ,/-21nU; cos (27U2) 


Y = /-21n Ui sin (27U2) 


are two independent standard Gaussian random variables. 


r2 —v/2 
Fg (r) =1- (1+5) 


i. It follows that the density function of R is equal to: 


(c) We assume that: 


"m —v/2—1 
fr(r) =r(1+5) 


ii. We deduce that the joint density of (X,Y) is: 


iii. We notice that fx y (x, y) is an even function of y. We deduce that: 


+œ | 
— |1 
[oss 


fx (x) 


fx,y (x,y) 


II 


r r2 —v/2—1 
e Ds 
2mr ( a -) 
= 1 1+ 
On 


2 2 —v/2—1 
T +y ) dy 


[ 


Hoo 1 2 2 —v/2—1 
(Or) m) 
T V v az? 


We consider the following change of variable: 


We have: 


and: 


2 
= 1 (vx ) du 
2 u2 /(u-1 — 1) (v + 2?) 
1 Vv+2? 
= —————— du 


 2u?yu! — 1 
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vi. 
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We obtain: 
o 1 L1 Wore 
fx (x) = 2 14 2 =I 
1 T v u u^ Nu = 
1 2 v/2—1 2 
1 1 
E A. pe: l DENS diu 
o 2m vju u? V u-1-—1 
2\ —-(71)/2 p1 
= (i+ E ^a- 97^ du 
27 V 0 
5 2\ —(¥+1)/2 
= pail vv A 
2. ^27/-2m- v 
We have: 
p(t Nae _ TEOG 
2-9 2r r(¥+1) 27 
_ TF) vw 
ar (3) 2m 
+1 


We finally deduce that: 


- T) 
bc 


( ^ E 
1+ 
V 


This is the probability density function of the t, random variable. 


Fz! (u) = " (a zu i 1) 


We deduce that the random variate r; can be simulated using the inversion 


method: 
ri " (a — uj)?!” — 1) 


where u; is a uniform random variate. 


iv. We have: 


. It follows that: 


X= y” (a caps 1) cos (21 Us) 
Y- y” (a =y wes 1) sin (21U3) 


where U4 and Us are two independent uniform random variables. 


In the Box-Muller algorithm, X and Y are independent. In the Bailey algo- 
rithm, this property is not satisfied because: 


fxy(z,y) 4 fx(x)fv(v) 
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13.4.8 Simulating random numbers using rejection sampling 


(a) It follows that: 


m 


NS 


y= te 
get (1 — qj 
88 (a, B) 


We deduce that: 


k' (£) = 


and: 


h'(z)=0 € (a-1)(1-—2)+(8-1)r=0 
" o—1 
m dim o 


The supremum of h (x) is equal to: 


ES a-1 Y*'!f B-1 V3 
Mel gam lara—a) (ara=a) 


We deduce that: 


Mermen (8-107 
T (o)T (8) (a + 8 - 2) 


We have reported the functions f (x) and cg (x) in Figure 13.3. c takes the value 
1.27, 1.78, 8.00 and 2.76. The acceptance ratio is minimum in the third case 
when a = 1 and £8 = 8. In fact, it corresponds to the worst situation for the 
acceptance-rejection algorithm. Indeed, when one parameter is equal to 1, we 
obtain: 


e ESRB) Ur n 
T ())T (8) (6 - 1*7 


'The acceptance ratio p tends to zero when the second parameter tends to infinity: 


lim p= lim —-=0 
Boo Boo C 


'The acceptance-rejection algorithm becomes: 


i. Generate two independent uniform random variates uj and us; 
ii. Calculate v such that: 
B5 
(Mp a 


v= u — u f= 
aae 


iii. If u < v, accept u1; otherwise, reject u1. 
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a= 1.5, 8 = 1.5 a = 3.0, $ = 2.0 
1.5 2.0 
1.5 
1.0 
1.0 
0.5 
0.5 
0.0 0.0 ze 
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 0 
x x 
a = 1.0, 8 = 80 a = 50,8 = 7.0 
10 3.0 
8 2.5 
2.0 | 
ê ææ f(x) — Beta 
1.5 ener c:g(x) — Uniform 
‘ 1.0 Y 
2 0.5 N 
N 
0 0.0 Wald i i 2 
0.0 0.0 0.2 0.4 0.6 0.8 0 
x x 


FIGURE 13.3: Rejection sampling applied to the beta distribution 


2. (a) It follows that: 


(1— g^! 
0538 (a, B) 


Its maximum is reached at point x* = 0. We deduce that: 


1 


C= =a o 


a8 (a, B) 
(b) We have G (x) = x^. We use the inversion method to simulate X: 
zc ue 
where u is a uniform random variate. 
(c) The acceptance-rejection algorithm becomes: 
i. Generate two independent uniform random variates uj and ug; 


ii. Calculate x = ul/^; 


iii. Calculate v such that: 


iv. If ug € v, accept x; otherwise, reject x. 


3. 
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(a) We have: 
G (z) E sel dt 


If x € 0, we obtain: 


If x > 0, we obtain: 


We deduce that: 


and: 


'To simulate the Laplace distribution, we consider the following transformation: 


1 1 
re-sign (u— 5) im (1-2 «-gl) 


where u is a uniform random variate. 


(b) We have: 
"EE 
g (x) 
2 
= "EL 
T 
We have: )A (2) 
) = J —-(z-1)h(z) ifz«0 
m { —(o—1) Bh. dx 
There are two maxima: z* = +1. We deduce that: 


c = max(h(—1),h(1)) 


2 
= [ze 
T. 


~ 1.32 


The functions f (x) and cg (x) are reported in Figure 13.4. 
(c) The acceptance-rejection algorithm becomes: 


i. Generate two independent uniform random variates uj and u2; 
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Oe 
0.6 F 
= — f(x) — Normal 
===: ceg(x) — Laplace 
0.5 F 
0.4 F 


| 
"| / í 
0.2 | / \ 


0.1 F 


so breraa ad i op 4b oup ge = Minona 


FIGURE 13.4: Rejection sampling applied to the normal distribution 


ii. Calculate x = sign (ui — 0.5) In (1 — 2 |u — 0.5|); 
iii. Calculate v such that: 


iv. If ug € v, accept x; otherwise, reject x. 


4. (a) We have: 


h(a) = 


E prt Hat) e * 


= T (c7? lng , elt) E et 


We deduce that: 


h(a) < Nt (efor) lng a e(t a) eo 


T (o) 
2m 
= (a+1)lInz ,-a 
T (o) € € 
2n o-41,—cz 


Monte Carlo Simulation Methods 263 
We have: , 
(z**e7*) = ((a- 1) - z) z^e7* 
The maximum is reached at the point z* = a+ 1. We deduce that: 


27 
_ De ec (at) 
om 5 (a) (a+ 1) e 


(b) We have: 


|. T(3/2) z2 372 
so = samt) 


1 42V ~3/? 
e) 
= (pare 


and: 


G(x) = i (20) 77 at 


t x 
B lu]. 
- um) 
v 2 2+ zx? 


We calculate the inverse function G ^! (u): 


1 £ r? 2 
2 pee = = (2u—1 
5 (1+ a] u 24x2 (2u ) 
e r? =2(2u—1)? +2? (2u — 1? 
e a?—2(2u— 1) +2? (2u — 1 
zx 2  2Qu-1y 
(4u? — 4u) 
2 (u — 0.5 
aj tae) 
u2—u 


It follows that we can simulate the Student ¢ distribution with 2 degrees of 
freedom by using the following transformation: 


V2 (u — 0.5) 


2 


xc 


u^ —u 


(c) In Figure 13.5, we show the acceptance ratio p — 1/c for the two algorithms. It is 
obvious that algorithm (b) dominates algorithm (a). In particular, the acceptance 
ratio tends to 0 when a tends to infinity when we use the Cauchy distribution 
as the proposal distribution. 


(a) We have: 


264 


Handbook of Financial Risk Management 


0.9 F ——— Cauchy 
===. Student 


FIGURE 13.5: Acceptance ratio for the Gamma distribution 


(b) We obtain c = 5 x 40% = 2. Therefore, the acceptance ratio is equal to 50% and 
we reject one simulation in two. This is confirmed by Figure 13.6, which shows 
the number of accepted and rejected values. However, the acceptance ratio is not 
the same for each states. For instance, it is equal to 100% for the state, which 
has the highest probability, but it can be low for states with small probabilities. 
In our experiment, we obtain the following results: 


falk) falk) frik) fm(Kk) 
49% 9.7% 16.1% 32.5% 
9.9% 19.6% 8.5% 17.1% 
19.6% 38.9% 0.0% 0.096 
10.2% 20.2% 9.2% 18.5% 
5 — 5.8% 11.596 15.8% 31.9% 
sum 50.4% 100.006 49.6% 100.0% 


me wN |Z 


where f4 (k) and fr (k) are the frequencies of accepted and rejected values, and 
fA(k) and fý (k) are the normalized frequencies. We have rejected 49.6% of 
simulated values on average. Among these rejected values, 32.5% comes from the 
first state, 17.1% from the second state, etc. We also verify that the empirical 
frequencies f} (k) are close to the theoretical probabilities p (k). 
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80 Rejected 


0 1:3 3.3 5.6 8.9 


FIGURE 13.6: Histogram of accepted and rejected values 


13.4.4 Simulation of Archimedean copulas 


1. Let f be a function. We note y = f (x). We have dy = 0, f (x) dz, x = f^! (y) and 
dz = 0, f! (y) dy. We deduce that: 


3y f (u) = 


9; f (£7 (y) 
We then obtain the conditional copula function: 
yg’ (u1) 
€! (e! (e (u1) € (uz))) 


Let vı and v2 be two independent uniform random variates. The simulation algorithm 
based on the conditional distribution is: 


Coj (us | u1) = 


{ Uy = U1 
Co (ug | u1) = v2 
We deduce that: 


Uy = U1 
uz = p`! (e (o= (<22)) -9 (i) 
'This is the Genest-MacKay algorithm. 


2. We obtain the Gumbel-Hougaard copula: 


C (u1, ua) = exp (- k- Inu)’ +(— ln uy 7) 
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3. Using the Gumbel-Hougaard copula, we have ọ (u) = (— In u)’, 7! (u) = exp (—ul/®) 


and (u) = —0u-!(—Inu)?-!. However, it is not possible to obtain an explicit 
formula for v^! (u). This is why we use a numerical solution 7 (u) for q^! (u). Finally, 
we obtain the following simulation algorithm: 


uj = Ui 


ae [- [Ce n] 


. We have: 


e 9u (e 9v: m 1) 
(77-1) € (709 T1) et 71) 


Coj (u2 | u1) = 01 C (u1, u2) = 


We deduce that: 


Coj (u2 | uy) =v 


Ss e041 e— Gua m e 9" = ve ? x ve 9" +v (e 99s _ 1) e 0u2 


-0 — 
uy = ln c is (c 1) ) 


va + (1 — v3) e- 9 


. We have: 
Q(u) =v 
1—-0(1— 
€ ln ( u) E 
u 
= 1—80(1—-u)-ue" 
E 1-6 
dE EU re, 
It follows that: 
1-6 
zu 
9€ (eQu)te(u)) = 
exp (m 1-60) + In ted) 0 
(1 — 6) "u1u» 


'The denominator is equal to: 


D = (1—0(1—u1)) (1 —0(1— u3)) —0uiu2 
= 1-8(1—u)) — 9(1— uz) + 67 (1 — uj) (1 — uz) — uuz 
= 1-20-0u,--0us5 — huruz + 0? (1 — u1) (1 — u2) 
= (1-60)-(1—-8)8(1— u) (1 — uz) 
= (1-0)(1—-0(1— uw) (1 — uz)) 


Monte Carlo Simulation Methods 


We finally obtain: 


e 1 (y (ur) + e(u3)) = 


The conditional copula is given by: 


C3 (u2 | u1) 


To find the inverse conditional copula C 


9, C (u1, u2) 


ug ( 


1 


U1 U2 


0 


ui) (1 


u2)) 


Qu1u» (1 = ua) 


(1—8(1—34) (1 —w))? 


(1—8(1—3) (1— 93)" 
(1— 60) us + bu? 


(1 —0 + 0u; + buz (1 — u1))? 


Co)1 (u2 | u1) = v. It follows that: 


-1 
2/1? 
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we have to solve the equation 


(1 — 0) uz + Ou2 = v (1 — 8 + Buy + uz (1 — u))? 


v (1—0 + 9u)? + 


20vu» (1 —0 + ui) (1 = ui) + 


v6^u2 (1 — uy? 


ag (v, u1) u2 + bg (v, u1) ua + co (v, u1) = 0 


v6? (1 — uy)? — 8 
20v(1—0--0u1))(1 41) — (1-6) €x 0 


Or: 
(1— 0) uo + 0u? 
We obtain: 
where: 
ag (v, U1) 
bo (v, ui) 
Co (v, u1) 


v(1—040u) 20 


We deduce that the solution is equal to: 


u2 = Vo (v, u1) = 


—bo (v, u1) — 9/03 (v, u1) — 4ae (v, u1) co (v, u1) 


2ag (v, u1) 


Finally, we obtain the following simulation algorithm: 


| 


uj = v1 


Ug = Vo (v2, 1) 


6. Using the previous algorithms, we obtain the following simulated random vectors: 


Gumbel 
0 — 1.8 


U1 


U2 


Frank 
6 =2.1 


ui 


U2 


AMH 
0 — 0.6 


ui 


U2 


0.117 
0.607 
0.168 
0.986 
0.765 


0.240 
0.478 
0.141 
0.993 
0.299 


0.117 
0.607 
0.168 
0.986 
0.765 


0.321 
0.459 
0.171 
0.951 
0.192 


0.117 
0.607 
0.168 
0.986 
0.765 


0.351 
0.452 
0.185 
0.930 
0.169 
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13.4.5 Simulation of conditional random variables 


1. Z = (X, Y) is a Gaussian random vector defined as follows: 


Cy vL st) 


We have: 


= py d XysYaa (a* — pa) 
and: 


Xu => COV (T) 
= Lyy — Eyr Dza Ery 


It follows that: 
TN (Hy + Eyr Yge (2* — us) , Eyy — Lye Uae Vay) 
Let Py be the Cholesky decomposition of Uy, — Aur gu We have: 
T = py + XysYsz (2* — uz) + PeU 


where U ~ N (0, I). We deduce the following algorithm to simulate the random vector 
T: 


(a) We simulate the vector u — (ui; Tm Un, ) of independent Gaussian random vari- 
ates N (0, 1); 


(b) We calculate Pj, the Cholesky decomposition of X, — Xyz X32 Day; 


(c) The simulation of the random vector T is given by: 
tH py + XysYu (2^ — nz) + Pau 


2. We have: 


5 
l 


J [Y — XysYxs (X — z*)] 
i [Y] — 4,Ez4 (E[X] — 2*) 
Hy — 135 EN (us — x^) 


Il 


We deduce that: 


T -E[T] = (Y - py) - Zye Ezd (X - pa) 


and: 


T 4 T 
cov (Ë) = ERY py) (Y n) | + 
Dya Tgh E [(X — pa) (X = he)" | E22- 
T 5 
[or — ty) (X - pe) | ERE 
= Xy Dye Bae Verige D = 23,3, 5 
ES My = DTE 


N 


TY TY 
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As T is a linear transformation of the Gaussian random vector Z, we obtain: 
TON (Hy ag Dysze (z* — fe) , Eyy — Mee Ug Day) 


We conclude that T = T. We deduce the following algorithm to simulate the random 
vector T: 


(a) We simulate the vector u = (u1,..., Un, ) of independent Gaussian random vari- 
ates M (0, 1); 
(b) We calculate P,, the Cholesky decomposition of ¥,,; 


(c) We simulate the random vector Z: 
Z € Hz + Pu 


(d) We set £ = (iss: 520s) and y = s rye oda): 


(e) The simulation of the random vector T' is given by: 
tey- Eys bza (x _ z") 


3. We note Z — (Zi, Zo... Zn) and Zi (21,..., Zi—1) = i | Zi = Z1,-+-,Zj-1 = 


Zi-1. Let u = (ui, U2,...,Un,) be a vector of independent Gaussian random variates 
N (0,1). To simulate Z; (z1,..., zi-1), we consider the following iteration from i = 2 
to it = nz: 


Zpo* aF oiir ame (z14—1 — Hii—1) + 


=i 
"Dy — Miyi-i3X4 a4 Vi -1 1 


with: 


zı € pu + y X111 


4. We obtain the following results: 


zı —0.562 0.437 0.427 0.404 1.984 
Z2 1.963 2.225 2.234 1.287 2.059 
z3 —0.808 4.013 7.643 —3.471 3.236 


13.4.6 Simulation of the bivariate Normal copula 


1. P is a lower triangular matrix such that we have X = PP. We know that: 


We verify that: 


We deduce that: 
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where N; and Ns are two independent standardized Gaussian random variables. Let 
nı and ng be two independent random variates, whose probability distribution is 
N (0,1). Using the Cholesky decomposition, we deduce that can simulate X in the 
following way: 


T1 S N1 
z2 + pni + y1 — pno 
2. We have 


C(X1,X2) = C(ó(Xi),0(X3)) 
C (U1, U2) 


II 


because the function ® (x) is non-decreasing. The copula of U = (U1, U2) is then the 
copula of X = (X1, X2). 


3. We deduce that we can simulate U with the following algorithm: 


uy € (21) = 9 (nı) 
| uz + ® (z2) = 6 (on, + VI- Pn) 


4. Let X; be a Gaussian random variable, which is independent from X4, and X9. Using 
the Cholesky decomposition, we know that: 


X2 = pXı + V1- p?X3 


It follows that: 


Pr{X2<o|Xi=2} = Pr{pXit VI- PX < a9] Xs =a} 


= Prl x; < ar 
vA - p? 


Then we deduce that: 
$5(zx1,223;0) = Pr{X1 < 21, X2 < x2} 


— pX 
ex < z1, X3 < sa] 


v1- p? 
X2 — pX1 


= k BU Om x) dx 
= EF E oca 


5. Using the relationships uy = (ri) u2 = (x) and ®2(a1,22;p) = 
C (9 (21) , ® (x2) ; p), we obtain: 


I 


= E BERTE 


C (u1, uo; p) 


ll 
Doe 
8 | 

© 
ie 
P ADS 
un 
RIS 
i | ee 
"sg 
N 
2 
8 
eg 
P npo" 
[en 
8 
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6. We have: 
Coj (u2 | u1) = u C (ur, u2) 
— g(t (u) 
v1- p? 


Let vı and v2 be two independent uniform random variates. The simulation algorithm 
corresponds to the following steps: 


{ uj = 1 
C» (u1, u2) = V2 
We deduce that: 


Uy — 04 
ug € ® (o9? (v1) + /1— g2671 (va) 
7. We obtain the same algorithm, because we have the following correspondence: 


c 


'The algorithm described in Question 6 is then a special case of the Cholesky algorithm 
if we take n; = 9^! (v1) and ng = 9^! (v2). Whereas n4 and ng are directly simulated 
in the Cholesky algorithm with a Gaussian random generator, they are simulated using 
the inverse transform in the conditional distribution method. 


13.4.7 Computing the capital charge for operational risk 
1. We obtain the following results: 


a E |CaR; (a)] c (CaR, (a)) ICs% (CaR; (a)) 
90% 251 660 180 0.28% 
95% 294 030 280 0.37% 
99% 414 810 885 0.84% 
99.9% 708 840 5410 2.99% 


where ICgs, (CaR, (a)) is the 95% confidence interval ratio: 


c (CoR: (a)) 


IC (GR, (a)) =2 x $^! (97.5%) x E [6i (a) 


95% 


Because this ratio is lower than 5%, we conclude that one million of simulations is 
sufficient even if a is equal to 99.9%. 


2. The results become: 


à E |CaR; (a)] c (CaR; (a)) ICs% (CaR; (a)) 
90% 183 560 128 0.27% 
95% 218 950 223 0.40% 
99% 332 870 916 1.08% 
99.9% 662 420 6 397 3.79% 


We conclude that one million of simulations is sufficient to calculate the capital-at-risk. 
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Copula ct 
— — Copula C* 


0.8 


0.6 


0.4 


0.2 


0.0 


In(S1 + S2) 


FIGURE 13.7: probability density function of In (S1 + S2) 


3. In Figure 13.7, we have represented the probability density function of In (S1 + S2) 
when the aggregate losses S; and S2 are independent (copula C+) and perfectly 
dependent (copula C*). We obtain the following capital-at-risk: 


a Ct C?  DR(C!|C7) 
90% 400240 435220 8.04% 
95% 453864 512980 11.52% 
99% 605927 747680 18.96% 

99.9% 993535 1371260 27.55% 


where DR (C+ | Ct) is the diversification ratio. 


4. In Figure 13.8, we have reported the capital-at-risk calculated with the Normal copula 
and the Gaussian approximation defined as: 


CaR(o) = 5 + 52+ y/ (Calo) - 51) + (Gio (e) - 8)" +... 


ec 2p (CaR, (a) — 51) (CaR; (a) — 52) 


5. Results are given in Figure 13.9. 
6. Results are given in Figure 13.10. 


7. For a high value of the quantile (a = 99.9%), the Gaussian approximation overes- 
timates (resp. underestimates) the capital-at-risk when the dependence function is 
the Normal (resp. t1) copula. We obtain this result, because the Student tı copula 
produces strong dependence when the correlation parameter p is equal to zero. We 
conclude that the Gaussian approximation is good in this example, except if the copula 
function highly correlates the extreme losses. 


x 105 
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a = 90.07% a = 95.07 
- 
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FIGURE 13.8: Capital-at-risk with the Normal copula 


a = 90.07% a = 95.07% 

o 44 n 5.2 
o o 
x vr x 

i 5.0 

4.2 

4.8 
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o 7.5 
o 
x 
2 
7.0 
6.5 
$— Copula 
E — Gaussian 
6.0 1 i i 1 j i i 1 i j 
0 20 40 60 80 100 20 40 60 80 100 
p (in 2) p (in 2) 


FIGURE 13.9: Capital-at-risk with the t4 copula 
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a = 90.07% a = 95.07 
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FIGURE 13.10: Capital-at-risk with the tı copula 


13.4.8 Simulating a Brownian bridge 
1. We remind that: 


z [W (s) W (t)] = min (s, t) 


We deduce that: 


W (s) 0 S S s 
W(t) | eN 0 |, t 
W (u) 0 s u 


2. We rearrange the terms of the random vector in the following way: 


W (s) 0 S s 
W (u) | e O0 l,ls ut 
W (t) 0 t t 


We note: 
B (t) = {W (0) | W (s) = ws, W (u) = wu} 


We know that the conditional distribution of W (t) given that W (s) = w, and W (u) = 
Wau is Gaussian with: 


jB()) = 0+(s o(; 2) (5)- (2) 
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var(B(t)) = t-(s D e 


and: 


| (t- s)(u— t) 
B u—s 
3. We deduce that: 
bou lic. ac (t — s) (u — t) 
u—s u—s 


where & is a standard Gaussian random variable. To simulate B (t), we then use the 
iterative algorithm based on filling the path and moving the starting point (s, B (s)) 
at each iteration. 


13.4.9 Optimal importance sampling 


1. Let X be a random variate from the distribution M (0,1). We have fuc = p(X) 
where v = 1 (X > c]. We deduce that: 


[mc] = Ely(X)] 
= / 1{x > c}ọ (x) dz 


= [oe a 
= 1- (o) 


c P 


Recall that var (fuc) = E [f2,5] — E? [fuc]. We have!: 


It follows that: 


var (mc) = p-p 
= p(1-p) 
= 9(c)(1- 9 (oc) 


We notice that pyc is a Bernoulli random variable B (6 (c)). 


2. We note Z the random variate from the distribution M (u, o°). We have: 


fis = Y (Z) £ (Z) 


lWe notice that p? (x) = 1 {x > c}. 
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where £ (Z) is the likelihood ratio: 


It follows that: ie 


s= 1 {Z > c} oeil Ey -i2 


> 


We deduce that: 


Z—p\2 


js = E[1{Z > goeiC "7:7 


and: 


ik) = Baza gore 


1/z-p 5 2 z? — 2uz + u? — 2072? 
Aus ch ee ee 
2 20? 


We note?: 


and 


2We assume that 202 — 1 > 0. 


Il 
qn E sn P CK ON 
= 
yj | 
EL 
N 
EEA 
P cS 
An x 
x 
= 
| 
NES 
Q 
N 
ed 
N 
PTR 
fz 
MES 
bo 
Q3 
N 
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It follows that: 


z [Ps] = ir E CEF) a 


We conclude that?: 


var (pis) = pone im (: (sert) (1— 9 (o)? 


'The probability distribution of frs is no longer a Bernoulli distribution. 


3. We have var (mc) = 13.48 x 1074. In Figure 13.11, we report the relationship be- 
tween u and var (frg) for different values of c. We find that the minimum value is 
approximately obtained for the same value of u: 


c u* var (pis) x 10 7 — var (frs) / var (Puc) 
0.80 3.158 0.05 0.34% 
1.00 3.154 0.06 0.45% 
2.00 3.151 0.14 1.03% 
3.00 3.151 0.22 1.6096 


Therefore, we can make the hypothesis that the optimal value of u does not highly 
depend on the parameter c. 


4. When c is equal to 1, we obtain: 
y" 2 
var (fis) = e (1- &(c 4)) - 0- 9(9) 
The IS scheme is optimal if the variance var (frs) is minimum. The first-order condition 
is then: " 
Ó var (fig) 
Ou 
We deduce that the optimal value p* satisfies the following nonlinear equation: 


2u* (1— 9 (c+ u*)) 2 o(c+ u^) 


In Figure 13.12, we draw the relationship between c and u*. We notice that: 


= 2ue^ (1 9 (e u)) - e" (c+ i) 20 


lim u^ —c 
coo 
We can then consider jj = c. In Figure 13.12, we also report the variance ratio 


var (frs) / var (mc) for the two schemes u = u* and u = c. We conclude that we 
obtain similar variance reduction with the heuristic scheme when c » 1. 


3In the case where u = 0 and o = 1, we retrieve the formula of the MC estimator: 


var (pis) = Q-9())-Q-9(9)» 
= G(c)—?(c) 
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Variance ratio (in Z) 


FIGURE 13.12: Optimal value u* with respect to c 


Chapter 14 


Stress Testing and Scenario Analysis 


14.3.1 Construction of a stress scenario with the GEV distribution 


1. We recall that: 


Xnin — b 
pm < o} zs P ec ad 
an 
= F" (ant +t by) 
and: 
G(x) = lim F” (anz + bn) 
n—oo 
(a) We have: 


mj 
3 
uem 
a 
3 
8 
+ 
c 
3 
l 
ERN, 


T= eg ^O mA Bap 
We deduce that: 


(b) We have: 


(c) We have: 


F” (anz + bn) 


0 de 
6+ 0o -1nl/e3 + 0nt/e — ;) ) 


a 
2 (1 (=a) ) 
= 


We deduce that: 
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2. 


4. 
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'The GEV distribution encompasses the three EV probability distributions. This is an 
interesting property, because we have not to choose between the three EV distribu- 
tions. We have: 

1+€ 


ss [e] Ep] 3 


We deduce that: 


u n 1+€ ui Li pb 
£ — 5 Ino” ( z bande ))- 


il 3 
MN 
=. ws 
= 
+ 
mn 
AT ORK 
B8 
a] | 
pus 
S ——Ó 
| 
ale 


. We notice that: 


'Then we obtain: 


lim G (x) 


£0 £0 


I 
i 
B 
[e] 
ta 
Ko} 
m 
l 
f= 
= 
| 
| 
InN 
PON, 
8 
Ql 
= 


(a) We have: 
G`! (a) = u — o7! [1 —(-In a) | 


When the parameter € is equal to 1, we obtain: 

G+ (a)=p-o (1 -(- Ina) !) 
By definition, we have T = (1 — a) ! n. The return period T is then associate 
to the confidence level a = 1 — n/T. We deduce that: 

R(T) -G^(1-n/t) 
- (u c (1 (—In (1 n/T)) !)) 
- (29) 
n 


We then replace u and o by their ML estimates f; and 6. 


Q 


(b) For Portfolio #1, we obtain: 
r(1Y)= (1% H (F 1) x 3%) = —34% 
For Portfolio #2, the stress scenario is equal to: 
r(1Y)2— (10% + E = i) x 2%) = —32% 


We conclude that Portfolio #1 is more risky than Portfolio #2 if we consider a 
stress scenario analysis. 
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14.3.2 Conditional expectation and linearity 


1. Using the conditional distribution theorem, we have: 


Y — ply X — [by 
——— | — px F 4/1 2 U0 
u nnn 


where U ~ N (0,1). It follows that: 


c c 
Y= @ m | Pay X + ayy 1 as 
x x 


Bo — Hy — pouty Le 
B — Pxy?y 


Oz 


o = oy4[1— pry 


We deduce that: 


2. We have: 


m(x) = E[Y| X =a] 
= E[f+8X+oU|X=al 
fo + Br +oB[U | X — a] 
= Bot Bx 


because U and X are independent. 
3. Since we have Y = o + GX + aU, we deduce that: 
23 eY 
cPo t 8X +oU 


eo XPT? 


II 


where U = e" ~ £N (0, 1). It follows that: 


ePo P ; [0] 
— epo $0? B 
because we have E [U7] = E [e""] = e$". Finally, we obtain: 


m(x) = exp c + 2 -xP 


II 
E 
"a 
P ES 
= 
e 


4. In the Gaussian case, we notice that the conditional expectation is a linear function. 
'This is not the case for the lognormal case. The use of ordinary least squares to 
compute a conditional stress scenario assumes that the distribution of risk factors are 
Gaussian. 
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14.3.83 Conditional quantile and linearity 
1. Using the conditional distribution theorem, we know that: 
F (y | X = 2) =N (yis, Lyla) 


where: 
Myla = Hy + Lyr Dza (£ — us) 


and: 


dng Hry 


qa (X) = Hylje + $^ (a) V X uva 


3 DE v 1 ly 
Yyyla = Xyy — LyeUzy > 


We deduce that: 


2. We have: 
Ga (@) — py + Es Egd (2 — pte) - 973 (a) /Dyy — Dye Ded Boy 
= fo(o)-- B'x 
where: 
Bo (a) = uy — Lys Dzs Ha +07! (a) (s = Dye bane Yay 
and: 


B = X Ys 
3. We reiterate that the conditional expectation is: 
m(x) = Bo + B'x 
where: 
Bo — py — Pye ay lis 


and: 
p= ee ag 


It follows that linear regression and quantile regression produce the same estimate 8, 
but not the same intercept. Indeed, we have: 


Bo (a) = Bo + $7 (a) us m Lye Lire Yay 


If a > 50%, the intercept of the quantile regression is larger than the intercept of the 
linear regression: 

Bo(a) > Bo Ifa » 5096 

Bo (a) = Bo If œa = 50% 

Bo (a) < bo Ifa< 50% 


We conclude that the median regression coincides with the linear regression if (X,Y) 
is Gaussian. 


4. We know that Z = $1 (F, (7)) ~ N (0,1). It follows that (X, Z) is Gaussian. The 
expression of the conditional quantile of Z is then: 


qé (£) = Bo (a) + 8' x 
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Since we have Z = $^! (1 — e-^7) and: 


In (1 — 9 (Z)) 
n 


T= 


we conclude that the conditional quantile of the default time is: 


hae In (1 — CC + B'x)) 


5. By definition, we have PD = F, (7). We deduce that: 
gq.” (x) = ® (Bo (a) + B' x) 
6. By construction, p is the correlation between Z = `! (F, (r)) and X. We note pz 


and X,, = o? the mean and variance of Z. Since, we have X,, = poxoz, we deduce 
that: 


Bo (a) = Hz m NNI + o! (a) xe 5 Sy YE 
Oz =i 2 (p00)? 
= uz—p~“ HU: +® (a) Coo 
Ox OF 
Oz =i 2 
= qa—p—pst9 (a)o;vl1—p 
Og 
and: 
9x02 
0; 
— oz 
= F. 


Because! u, = 0 and ce; = 1, we finally obtain: 
qa” (£) = ® (o(a) +8 2) 


Oz a 
= o (uot (x — pz) + 97! (a) c, =A) 


x 


o (a7 (a) viz + E) 


Ox 


II 


7. We observe that the conditional quantile of the default probability is not linear with 
respect to the risk factor X. However, we notice that 71 (qẸP (z)) is a linear function 
of X. This is why we may use the following quantile regression in order to stress the 
default probability: 

$ (PD) = bo +8'X +U 


where X is a set of risk factors. 


! Indeed, F+ (7) ~ Ujo,1] and 97! (Uto) ~M (0, 1). 
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15.4.1 Elastic net regression 


1. (a) Let f (8) be the objective function. We have: 


(01 T Ue 
f(8) = g(Y-X8) (Y-X5)« 22.5 
= S6 X" Xj -B'X'Y« FY + 3878 
= 567 (X X Alk) 8 X Y + YTY 
We deduce that: 
s = (X'X+XIk)b-X'Y 


The first order condition dg f (8) = 0 implies that: 
Bras = (XTX Amp) X" Y 
(b) We recall that 95 = (XTX) ' XTY. We deduce that: 
(XTX + AI) proe = (XTX) P5 x" v 
and: 
pride = (x'x- Am) ((X'X) ) ges 
= (XTX) xxxn) P" 
= (te + (TX) *) fem (15.1) 


(c) It follows that: 


[emm] = mine) a] 


= (IA (xTx)') B 


where f is the true value. If IE d = f, we obtain: 


(Ik +A (XTX) ) = Ix e Ir+tA(X"X) = Ix 


e A(X'X) -0 


= r=0 
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From Equation (15.1), we deduce that: 


—1 


var (Bre) = (I +2 (Cx) var (8°) (Ix +a (XTX) *) 
= e (Ik +A (xTx)') (XTX) ^ (Tx + (xTx)') 
= 0° (X™X+Xk) (Ix +\(XTX) )) B 
= e (x +A (XTX) ) (XTX +x) p 
= 09 (XTX +?( n + Xx) 
= e(X'x«Q)' 
where: 


Q = (XTX) 4 21Ik 
Since Q is a symmetric positive definite matrix, we have: 
(X'X+Q) > (X'X) 
where > is the positive definite ordering. Finally, we obtain: 
var (a) > var (orden) 
We have: 
X ree 


X(X'X4AI) X'Y 
HY 


ney 
Il 


II 


where H = X (XTX +AIx) X". We deduce that: 
grow — tr(X(X™X4 Mx) x7) 
= e(QXeAm) X'X) 
We consider the singular value decomposition X = USV where U and V are 


two orthonormal matrices, and S is a diagonal matrix that is composed of the 
singular values (51,...,5&). We have X! X = VS?V' and: 


X'X SA = VV! +AVV' 
V (S? + AIK) V" 


=! = V (92+ AIr) V and: 


It follows that (X'X TAI K) 
(XTRA X'X = VE FAL V VST 


= V(S?+XK) SVT 


We finally obtain: 
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(f) If X is an orthonormal matrix, we have X! X = Ix and: 
Bridge - (Ik 4 AF) fes 


ols 
14A 


Since we have var (Bree) = o? (Ik +Q) ! and Q = M Ig + 2AIK, we deduce 


that: 
2 


Aridge = d I 
var (re) (sore) 
1 Rols 
+a? a E var (ô ) 
Concerning the model degree of freedom, we obtain: 
K 
1 K 
df (model) = = 
2» 1+A 1-A 
(a) We have: 
1 
f) = ;(Y-X8) (Y- X8) 
X K K 
5 (odoin +(1 0) >t) 
k=1 k=1 


We note A = ( Ig —Ik ). We introduce the parameter vector 0 = (8*, 87) 
such that 8 = 8+ — 67, B* > 0 and 8^ > 0. We notice that: 


K 
S lal 
k=1 


II 
iM 
> 
ane 
l 
D 
z] 


= 1'0 
Since we have 8 = AQ, it follows that: 


1 
f(s) = 50 AT (X'X+A(1—a) Ix) A0 — 
1 Aa 
0 A'X'Y --Y' v4 — (071 
*35Y Y+ (0 1) 
'The corresponding QP program is then: 


1 
6 = argmin 50'Q0—6'R 
uc. 020 


where Q = A (X'X+A(l—a)Ix) Aand R= A'X' Y + c 
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(b) Results are given in Figure 15.1. 


a=0 a = 0.25 


COO E04,..0.6 0.8 1.0 


€——À eee A 5 me 


FIGURE 15.1: Comparison of lasso, ridge and elastic net estimates 


15.4.2  Cross-validation of the ridge linear regression 


1. The objective function is equal to: 


LEN = 5(Y-X8) (Y -X+ 2878 


Dlr NIe 


(v'v -28'X'v +! (K'X+XIx) 8) 
The first order condition 0g £ (8; A) = 0 is equivalent to: 
-X' Y+ (X'X- Aly) 6 =0 
We deduce that: 
Ê= (XTX +AIx) X'Y 
2. We have: 


ĝi = (XIXa tAk) X Y 
= (X'X- zz] +I) (XTY - ziyi) 
= (Q — zum) (x'v — LiYyi) 
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where Q = X'X + AI x. The Sherman-Morrison-Woodbury formula! leads to: 


2 z 1 
(Qazi) = Q ies 


= e (I)e 


) Q a.2)Q™ 


where h; — a) Qtr = zl (X'X + Mk) ` zi. We can now obtain a formula that 
relates the ridge estimators B..; and B. Indeed, we have: 


B.; = Q'IX'Y-Q'ziy + 


(XTX 4 XK) a; 
Lh; 


where à; = yi — a} B. 
3. We notice that: 


Ui—i = Wi — Vi,-i 


is (à (X'X+ de) 


Press = — (yi — diy 


1Suppose u and v are two vectors and A is an invertible square matrix. It follows that: 


1 


AT! TA-1 
14 vlA-iu d 


(Aw) = A7! 
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5. Let H(A) = X(X'X AL) X! be the hat matrix?. By considering the singular 
value decomposition X = USV! , we obtain: 


X'X = VSU'USV" 
Vs?v" 
and: 
X'X AL, =V (+A) V! 
It follows that: 
dgedel) (A) = traceH (A) 

= trace(X (XTX + Mx) X7) 

= trace ((x™x + AL XTX) 
Since we have: 
(S? + Mk) VT) ^ v gv 
T) ($2 + Arg) " v 1vs?vT 
V (S? E Arg) ^ S?v 


(X"X4-AL) X'X = (V 
(V 


we finally obtain: 


agate) (A) = trace ((V7) ^ (S + Mx)! S?V7) 


( 
=. fgg ((s? 3 ih S2yT gu 
( 


S? + Mx) S?) 


We verify the properties df "4€? (0) = K and qf("ed*? (o5) = 0. 


6. Since we have Y = H(A) Y and U = (J, —H(A)) Y, we can express the PRESS 
statistic as: 


whereas the generalized cross-validation statistic is defined by: 


GCV = 53 (& EDI 


i=l 


where h = n~! 357, H(A), ;. We verify that the generalized cross-validation statistic 
corresponds to the PRESS statistic where the elements H (A); ; are replaced by their 


?H transforms Y into Y (pronounced “y-hat”). 
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mean h. We have: 


ETE. m 


T^ 


= toos (In - H (4)) 
n 

o 1 3 n 82 

© n K st +X 


u issue c 
EM K (s2 + X) 


k=1 


1 254 
GCV = : 
n 1 K (n—K) sf +n 
oS ees! 
K —2 
s ^ (n — K) E +n ^ 
— 5 (>: EXE S RSS (809) (15.2) 


The effect of A on the GCV statistic is not obvious since we have: 
AZ => RSS (809) E 


and: 
K 


(n — K) s2 +nà 
A — ss 
a 2 s +À 7 
7. Since we have 1 — h = n^! trace (I, — H (A)), the GCV statistic is equal to: 


e I > (sao) ny (809) 


= n(trace (I, — H (A)))? RSS (ô (9) 


From the Woodbury formula, we have?: 


In-H() = L-X(X'X-Alk) X! 


xx! \ 
I, NL TR LO 


3'The Woodbury matrix identity is: 


(A+ BCD)! = A! — A3 B(C? + DA!B) DAT! 


292 Handbook of Financial Risk Management 


Let A; be the eigenvalues of the (n x n) symmetric real matrix XX '. We have: 


trace (In —H(A)) = J GIN 


because the last n — K eigenvalues A; are equal to 0. Moreover, we have Ay, = s?. 
Finally, we obtain*: 


KY 5d : 
GCV =n (ah) RSS (809) (15.3) 


8. The values of Bg when A is equal to 3 are reported in Table 15.1. In the last row, 
we have also given the ridge estimate B calculated with the full sample. Using the 
values of i—i, à; ;, à; and h; (see Table 15.2), we obtain? Press = 0.29104 and 
GCV = 0.28059. 


15.4.3 K-means and the Lloyd's algorithm 


1. We have: 
K 
lei- gl? = XO (tik — 242) 

k=1 
K K K 

= 2 2 

= X, Ti k t E — 2; Zi kj k 
k=1 k=1 k=1 

and: 


K K K 
~12 - i 
k=1 k=1 k=1 


4We verify that Equations (15.2) and (15.3) are equivalent, because we have: 


K 
ibs ei) 1 wn (n- K) (sd +A) + KA 
K 82 +A K S2 +A 
k= k=1 
“A 
= K)4 
m ) 2 x 


5We have h = 0.24951, s1 = 58.71914, s2 = 51.42980, s3 = 45.83216, s4 = 37.91501 and s5 = 26.61791. 
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TABLE 15.1: LOOCV ridge estimates P 


£1,-i £5, —i b3,—i Ba, i B5.—.i 

1.2274 —0.9805 0.1298 4.0.4923 0.0398 
1.2307 —0.9865 0.1357 —0.4946 0.0415 
1.2835 —0.9827 0.1362 —0.4925 0.0410 
1.2808 —0.9876 0.1355 —0.4957 0.0417 
1.2296 —0.9849 0.1358 —0.4948 0.0420 
1.2300 —0.9851 0.1361 —0.4941 0.0422 
1.2335 —0.9870 0.1287  —0.4898 0.0476 
1.2219 —0.9838 0.1357 4 —0.5047 0.0463 
1.2281 —0.9844 0.1382 —0.5005 0.0445 
1.2319 —0.9889 0.1401 —0.4912 0.0444 
1.2299 —0.9856 0.1353 —0.4938 0.0430 
1.2300 —0.9849 0.1355 4.0.4950 0.0411 
1.2280 —0.9817 0.1320 —0.4974 0.0407 
14 | 1.2307 —0.9855 0.1365 —0.4965 0.0427 
15 | 1.2314 —0.9839 0.1360 —0.4937 0.0426 
16 | 1.2285 —0.9861 0.1390 —0.4944 0.0393 
17 | 1.2289 —0.9843 0.1346 —0.4958 0.0390 
18 | 1.2246 —0.9855 0.1370 —0.4892 0.0426 
19 | 1.2267 —0.9878 0.1356 —0.4920 0.0443 
20 | 1.2459 —0.9890 0.1386 —0.4830 0.0358 


B | 1.2301 —0.9854 0.1358 —0.4941 0.0420 


oa 3 
WNHrFOWOONDWOTHRWNYEH|™ 


TABLE 15.2: Computation of i,t Q; i, [m and hi 


>. 


Yi Îi, —i Q; —i à; hi 


—23.0 —22.3270 —0.6730 —0.5130 0.2378 
—21.0 —21.2041 0.2041 0.1796 0.1201 
—5.0  —5.4804 0.4804 0.3950 0.1778 
—39.6 —39.7745 0.1745 0.0857 0.5091 
5.8 5.7076 0.0924 0.0828 0.1040 
13.6 13.5376 0.0624 0.0525 0.1582 
14.0 14.7404  —0.7404 —0.4168 0.4371 
—5.2 —4.3994 —0.8006 —0.5534 0.3087 
9 6.9 7.0607 —0.6607  —0.5306 0.1970 
10 | —5.2  —5.6244 0.4244 0.3106 0.2681 
11 0.0 | —0.0913 0.0913 0.0595 0.3483 
12 3.0 3.2119 —0.2119 —0.1974 0.0682 
13 9.2 8.9014 0.2986 0.1664 0.4428 
14 26.1 26.3478  —0.2478 —0.1842 0.2568 
15) -6.3  —0.4835 0.1835 0.1192 0.3506 
16 11.5 10.9309 0.5691 0.4763 0.1631 
17 4.8 4.3120 0.4880 0.4360 0.1065 
18 35.2 34.4379 0.7621 0.5531 0.2742 
19 14.0 13.2528 0.7472 0.6633 0.1123 
20 | —21.4 —22.5438 1.1438 0.7438 0.3497 


ANAK WNH 
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where zk) = n lY tie. We note $1 = iy Xx lx; — zl? ids 
n x lti — z|^. We deduce that: 


Be XX Qeon ena] 
4—1 j—1 
K n 


= 3 t,t 32k — 2i kj) 
i=l j—1 
Ky n m m 7 
= 275 nM zin xl,—2M zu 3 tjk 
E i=1 j=1 i=1 j=1 
n n 
Q> Tik T 5 sunw) 
i=1 i=1 


OX) 
i=l 


n K n K n K 
$9 = nM Mosi un Tik) —2n M M tikt) 
1 


and: 


i—1 k=1 i=1 k= i=1 k=1 
K n n n 
= 5 Q> 2k + n3 El) — 2nZ (x) > 2 
k= i=1 i=l i=l 
K n 
= Xo Q> 27 + n^ Eu — est 
k= i=1 
K n 
= 5 (^x 2 — 23 
k= i=l 


We conclude that $1 = So. 
2. Using the previous result, we have: 
5 2 5 |z; — zy I? = Nj > æ: — 2; ||? 
2 o (i)2j Ci )=j G)-3 
where x; and nj is the mean vector and the number of observations of Cluster Cj. 


3. The first-order conditions are: 


Of (Iss --, Hno) 
Op; 


=0 for j = 1l,...,nc 


where: 
no 
2 
Fn TP = sony 5 EZ m ul 
j-1  C(i)oj 
Since we have: 


a 
—— |e; — ps |[" = —2 (xi — uj 
gag le - nil = 26 - n) 
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it follows that: 


ao =—2n; M («i — uj) =0 
Hj C(i)—j 


We deduce that u} = (uj 5, . +, H3 y) where: 


Finally, we verify that the optimal solution is u} = zj. The Llyod’s algorithm exploits 
this result in order to find the optimal partition. 


4. In Figure 15.2, we have reported the classification operated by K-means, LDA and 
QDA. We notice that the unsupervised K-means algorithm gives the same result as 
the supervised LDA algorithm. 


Observation QDA prediction 
7 7 
6 e - 6 e js 
e 9 
5 LA > 5 $ : 
~ le 00° uu ~ le oe nn 
3 u 3 L| 
x e. mA x te @ A 
2 " aa aA, A 2 @ à a ^ 
1 a, > a, A 
: AA 0 AA 
0 1 2 3 4 5 6 0 1 2 3 4 5 6 
X 6 Class C, X 
e WClass C; "me 
LDA prediction AClass C3 K—means prediction 
7 7 
6 [) 9 6 [] 9 
o 1$ "a 1$ 
4 L| 4 LI 
DEKILÀ e? [C] DEKILÀ e? [C] 
"4 D Q "a D Q 
2 @ 4 pä 2 (OWN "E 
1 a, " 1 a, a 
= ar 
0 1 2 3 4 5 6 0 1 2 3 4 5 6 
X X 


FIGURE 15.2: Comparison of LDA, QDA and K-means classification 


15.4.4 Derivation of the principal component analysis? 


1. We have: 


var(Zi) = var (BX) 
BL X 


®The following exercise is taken from Chapters 1 and 2 of Jolliffe (2002). 
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The objective function is to maximize the variance of Z; under a normalization con- 
straint”: 
fi = argmaxf X8 
st B'B=1 
'The Lagrange function is then equal to: 
£(8;X) = B'Yg -  (8' 8 — 1) 


Since, the first derivative 0g C (8; A1) is equal to 246 — 2418, we deduce that the 
first-order condition is: 
XBi- AX (15.4) 


or: 
(X — Ak) i = 0 

It follows that (1 is an eigenvector of X and A, is the associated eigenvalue. Moreover, 

we have: 


var (Z1) 81 X81 


Ay 


I 


Maximizing var (Zi) is then equivalent to consider the eigenvector £1 that corresponds 
to the largest eigenvalue. 


. We have: 
var(Zag) = var (83 X) 
= fX» 
Using Equation (15.4), we deduce that the covariance is: 
cov(Zi, Z2) = fi Xf» 
= ify Be (15.5) 


The objective function is then to maximize the variance of Zo under the constraints 
of normalization and independence between Z4, and Z»: 


fo =  argmaxf X6 
8 B-1 
s.t. { B78 =0 


The Lagrange function has the following expression: 
L (P; A2, P) = BT XB — A (B'8—1) — pbi L 
We deduce that the first-order condition is: 
23/85 — 2A282 — «1 = 0 


It follows that: 
£1 (2082 — 22585 — ofi) = 810 = 0 


T Joliffe (2002) notices that the solution is 81 = oo without this normalization. 
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or: 
261 X8) — 22281 b2 — vB! B1 = 0 


Since the 1** and 2?* PCs are uncorrelated, Equation (15.5) implies that 9j U2 = 0 
and 8j 82 = 0. We deduce that —y 8, = 0 or y = 0 because fj f, = 1. In this case, 
the first-order condition becomes: 


X85 = A202 


Again, f$ is an eigenvector of 3 and As is the associated eigenvalue. Maximizing 
the variance of the second PC is then equivalent to consider the eigenvector 65 that 
corresponds to the second largest eigenvalue?. 


15.4.5 Two-class separation maximization 


1. The total scatter matrix is equal to: 


TL n 
= Meal -AY e] + maa 
J= t=1 


T 
"T 
= X sia] -nih 
i—1 


For the within-class scatter matrix, we obtain: 


Sw = MS 


ja 
J 
^ AC. 
E 5 (xi — 5) (x; — By) 
j=1iec; 
J 
= ` 5 Tiv — NG MG MG 
j=1 \iec; 
J J 
^ ^ ^ A A 
= > Tit; — NjhHjhj 
j-1i€C; j=l 
n J 
^ T ^ era 
= Tili — Nj Mj Hh; 
i=1 j=l 


8We cannot use the first eigenvector because el B2 must be equal to zero. 
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Concerning the between-class scatter matrix, we deduce that: 


Sp = Yon(üj-i)(à;—-R) 


j=l 
J J 
= M nh; 28 nj] + nga" 
j=l j=l 


= on — 2A (mA) + nat 


because nfi = a nj [t;. It follows that: 


n J J 
Sw +S5g = mial -X nfd] | + |Y nhh — nip 
i=1 j=1 j=l 
n 
= J niz — np" 
i=1 
= § 
2. We have: " z 
A Nf T N2442 
E EE 
ny + "19 
It follows that: 
2 4 niĝı maf nifi naf 
HTH = 
nı T ne ny T ng 
0 mif d naíty — ni flr — naf 
Tli + Ti2 
n2 A A 
zü ae (fia — fiz) 
and: gj 
fiz — = (jz — fir) 
ny + ne 
We deduce that: 
» rS SOT. " ANA XX 
Sg = m (fi — f)(f1— B) +12 (Ae — Â) (Ae — A) 
2 
no " NES LAT 
= m ( ) (âi — fi2) (Aa — fia) + 
ny T Ng 


2 

ni A ^ ^" ^ 

na ( ) (fiz — fir) (fio — fir) 
NI T N2 

i MIN2 ^, n ~ RT 

p (f£ — fi2) (A — faz) 
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and?: 


B'Ssp = 


where íi; is defined as: 


3. We have: 


and: 
8'$,8 = B'S” (zi fy) (ei — fy)’ B 
iECj 
= YN(B8'-B8'à) 


icC; 


We deduce that: 


8' (S1 + $3) B 


B'S18 + B'S3B 


z2,2 
= sits? 


D 

4 

n 
x 

X» 
| 


II 


4. We have: 
B' SB 
B'SwB 
a D. 
T4705 (fi = ji) 
ni-Fm»o 81 + 82 


J(8B) = 


We finally obtain the following optimization program: 


g* — arg max (fa = ñ) 

52 + 82 
In order to separate the class Cı and Co, we would like that (ñi — jin)? is the largest, 
meaning that the projected means must be as far away as possible. At the same time, 
we would like that the scatters 5? and 82 are the smallest, meaning that the samples 
of each class are close to the corresponding projected mean. 


because Bl i — B T fo is a scalar. 
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5. At the optimum, we know that: 


SpB = ASwB 
We have: 
nyn » » Be » 
Sg = 1? (fia — fia) (i1 — fia)" B 
ny + ng 
nanı » p y " 
a — (li — fiz) (8 a E B" fiz) 
ny + ng 
= (fi — fiz) 
where: 
| 1T,n2 TA TA 
a (8' ài — B' fiz) 


We deduce that: 

ASw B = 7 (fis — ĝ2) 
or: 

8 = {SW (n — fia) 


It follows that the decision boundary is linear and depends on the direction fij — fiz. 
Since we have J (8^) = J (B) if 8' = cB, we can choose the following optimal value: 


* = Sy) (fis — fi) 


. We have: 
Sw = S+S 
u 16.86 8.00 E 21.33 18.33 
B 8.00 18.00 18.33 18.83 
= 38.19 26.33 
g 26.33 36.83 
and: 


sg, ( 089 -310 
BSN md 10.86 


The equation Spf = ASw is equivalent to Sw Sn — AB. 'The largest eigen- 
value of the matrix S5 S B is equal to 0.856, and the associated eigenvector is 
8* = (0.755, —0.936). We deduce that the scores of the 13 observations are —1.117, 

2.990, —3.352, —0.544, 1.147, —1.843, —0.907, 0.755, 0.573, 2.083, 0.392, 0.784, and 
—0.152. We have gj = —1.372 and gj, = 0.739. It follows that ~ = —0.317. The 


assignment decision is then: 


s; < —0.317 > i € C 
si > —0.317 > i E€ Co 


We observe that the 5*à observation is incorrectly assigned to Class Co, because its 
score 1.147 is larger than —0.317. 
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15.4.6 Maximum likelihood estimation of the probit model 
1. The probit model is defined by: 
p=Pr{Y =1 | Xo =o (x' 8) 


We deduce that the log-likelihood function is equal to: 


eo) = Enry =u) 
= Yin (0 - pi) pt) 


= Mü-y)hn(1-p)-wnpi 


= X (1-u)ln (1-8 (z] 8)) + yind (z/ 8) 


i= 


2. We have: 
Ok; 
(o) oe) Lik $ (zl B) xix 
= 1 i i 
EET TOT B TOT 


We deduce that: 


-(1- i d i can 
(yi & (era) ó (515 : 
® (x7 8) (1-8 (erp) 
It follows that the score vector is equal to: 
yo (9n 7 9 (z; B)) 9 (z; B) 
SWESS Gat 


Vo 
na 
— 

l 
e 
Lum 
8 

Seal 

Vo 
wa 
wn 


i=1 


() = gg G1) - e GT9) 
(a 


i P) vi; = 9 (z; B)) — 9 (zi B) $ (2; B) vi 
x a 28 (x 8)) 6 (v; B) vi; 


() = gy (w- 8 (278) 6078) 


= (wi - 9 (zi B)) 6 (zi B) (zi B) tij = ^ (zi B) tij 
= (-ywm B+ © (rj B) x) 8 — > (aj B)) o (x B) xi; 
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We note: j i 
a EL) (= 8 (a7 p)) 8 &(8) 
Ga ó(zl8)zi;mi. O8x0B; 
= (278) (1 - & (274)? 
$(z;B) (1—O0(z; B 
B; = , 
6 (x1 8) s 
We obtain: 


Ai = (-yiml b +(x] B) x] B — o (v1 B)) ® (x; 8) (1— 9 (27 8)) — 

(y; — 9 (xj 8)) (1 — 29 (x; 8)) 6 (v; 2) 

= -y: (x! B) x] B+? (x18) x] B — (x1 8) d(x) 8) + 
y? (a B) x] 8 — 9? (a) B) x] B+ &? (a! B) 6 (x1 8) — 
yio (x1 B) + 9 (x1 8) d(x] 8) + 24: (x7 8) 6 (27 8) - 
20? (x] 8) 6 (a; 8) 

= S? (xj 8) (—2] B) + 9? (x16) (A w)2] 8 — 6 (xl 8)) + 
& (z] 8) yi (7j B + 26 (21 B)) — yd (l B) 


and: 


Bi = w(é(zi 8) t zi 89 (a; 8)) (1— 9 («18)) + 

(1 — yi) e (ef B) - «1 8(1— 9 (27 8) ® (7 8)” 

= yid (xj 8) + yi (x; 8) (vj B) — 2y:® (aj B) 6 (a; 8) — 
2y9? (x; B) (xj B) + vi? (v B) 6 (z; B) + 
yi9? (xj 8) (xj B) + © (x; 8) 6 (zi 8) — 
yi9? (v 8) 6 (xj B) — €? (x; B) (xi 8) + 
$? (x; B) (xj B) + vi? (x; B) (xj 8) — vi? (vj B) (vj 8) 

= (zi 8) (zi 8) + 9? (zi B) (~ (1+ yi) si B+ (z; B)) + 
© (zj B) vi (z; B — 26 (21 B)) + wi (z; B) 

Since B; = — Aj, we deduce that: 


n) = y 2450 


i=l 080 BT 
E TED E 
2 iG i-oaiay C7) 
MEC 4 (078) " 
- CàAPRuyü-egny 7) 
= -9 H (ax!) 


4. The Newton-Raphson algorithm becomes: 
Bux) = bis) — H^ (809) S (Boe) 
where 65) is the value of 8 at the step s. We may initialize the algorithm with the 
OLS solution f = (X' X) XTY. 
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15.4.7 Computation of feed-forward neural networks 


1. We have win = op hkvik or U = XB" where U is a n x n, matrix, X is a 
n xn, matrix and B is a n; x n, matrix. Then, we apply the non-linear transform 
Zih = fa, (Uin) or we perform the element-by-element function Z = fe, (U) where 
Z isan xn, matrix. The calculation of vi j = rou "Jj, Zi,n is equivalent to compute 
V = Z4! where V is a n x ny matrix and y is a ny x n; matrix. Finally, we have 
yj (£i) = fey (vij) or Y- fey (V) where Y is a n x ny matrix. It follows that: 


^ 


Y = fey (V) 
jy (z4') 
= fey (ies (U) 47) 
= fey (ae (XB") y!) 
2. If fo. z(z) = fy (2) = z, we deduce that: 
Y=Xp'y'=XA 
where A = (8) * is a n, x n, matrix. The least squares loss is then equal to: 


n Ny 


L(0 = $2.55) 


n Ny 


= 5> ; (y; (ws) — yii) 


i=1 j=1 


- 5 race ((9 — )' (f -v)) 


Z 5 trace ((XA —Y)' (XA - Y)) 


We deduce that: 


D 
l 


= argmin ; trace((X A — Y)T (XA — Y)) 
= (X'X)'x'Y 


and: 
4B—A'-Y'X(X'X) 


We conclude that it is not possible to estimate 8 and y separately because it depends 
on the rank of the different matrices. Indeed, A has n; x n, parameters whereas the 
product y6 has n, (nz +ny) parameters. In particular, the model is overidentified 


when: 
Ng X Ny 


Nz 
Ne + Ny 


Otherwise, we obtain a constrained linear regression: 
5 1 
(8.4) E arg min 5 trace ((XA — Y)' (XA — Y)) 


304 Handbook of Financial Risk Management 


3. Using chain rule, we have: 


Ə Lij (0)  O€£(yp (xi) wp) Oyy (xı) 8 viy 


OYj,h yj (xi) vij’ Ó jn 


We notice that: 


Oh 


Ovi j E Zh if j = j 
0 otherwise 


It follows that: 


0 Li; (0) 
E =€ (yj (x;) 55) fy (vi,4) Zih 
and: 
9 Ci, (0) 
codo INFR iy 
OR 
We also have: 
O£,;(0) _ 3E (Ys ri) , yig) Oy; (vi) O Vij Ozin Ouin 
O Bhk OY; (xi) Ovi j Ó Zi.n Ouin O By k 


€ (y (xi) Vos) fey Wig) vio fis (Uih) Tik 


I 


Finally, we deduce that: 
0 £ (0) 


T 
Oy = (Gyy © Gyw) Z 


and: 


8 £ (0) 
0 


where 0,£(@) is a ny x n; matrix, Gy = £' ee Y) is a nx ny matrix, Gy. = f} y (V) 


= (((Gy,y © Gyw) 7)© Gau) X 


is a n x ny matrix, Og£ (0) is a n; x n, matrix and Gz u = f; z (U) is a nx n; matrix. 


4. We have: 


and: 


(1+ e77)? 

1 1 
= 1+e-7 (1 ra] 
f(z) - f(2) 


It follows that foy (vij) — Sey (vi,j) (1 = Jg (vi,;)) = Yj (xi) (1 = Yj (2)) and 
foz (Uin) = fe (uin) (1 — faz (uin)) = zin (1 — 2i). We also have: 


Il 


and: 


We deduce that: 


de = (yz (xi) — vo) uj (xi) (1 — yz (21)) zin 
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and: 
O £i; (0 
25:20) — (y, (23) = ws) ui (0) 1 — w 0) Hazen (1 = a na 


The matrix forms are: 


o p 
where G (f) =Yo (a — Y ) and G(Z) = ZO (nxn, — Z). 


. We have: 
E (y) 7 — (ylnd + (1— y)In (4 — 9)) 
and: 
T EN (l-y) 
Gy) = $^ 0-9) 
80-990 -$) 
$0 -— 1$) 
af sg 
g(1—9) 
We deduce that: 
0 Li (0) (y (zi) — yi) 
= (y(xi) — yi) Zi,h 
and: 
OL; (0) (y (zi) — yi) E ee 3 agros as 
8 Bnk x y (xj) ü =e (23)* (2) (1 y( 13) ^Yh Zi,h (1 ih) i,k 


= (y (ai) — ys) YnZi,n (1 — Zi h) Zi,k 


The matrix forms are: 


and: 
OL (0) 


: T 
—— -(((f-v)yezoa-2) x 

8B ((( VO ZOE) 
. In the case of the softmax activation function, the value of y; (x;) is equal to: 


yj (ti) = fay (vij) 


Evid 
ger 
This implies that y; (x;) depends on y; (z;): 
< G evs aye ei 
2.4 =) 5 y T we wy 
je np pu cl EM 


The loss function is additive with respect to the index i, but not with respect to the 
index j. 
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7. We have: 


It follows that: 


We also have: 


We deduce that: 


and: 


We notice that: 


(*) 
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Ə f(z) x Dees e^! — e? 
de Oe). 
Zj nc Zu Zj 
UE ELEM =e 
CORP 
e?3 e 
^ Ne ( x. =) 
= f(G)ü-f() 
O f (25) E eti 
ô zj no 


Zan 
=] evs 


ew") 
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ei ei! 
= Rm eži" DAS eži" 
= —f (25) f (z7) 


5 0€ (yj (zi) xin) Q yj (24) 
"iz Oy; (xi) Ovi j 
Yi j OY; (24) i Vi,j' 3 yj (24) 
Yj (zi) Ovi j br Yj’ (xi) Ovi j 
Yi j Yi, j’ 
x Vij T= Ui j)) — Vi j Ui 5! 
Ui,j Vi, 
x: zi) (1 — yj (i j (vi) uy (Li 
-yi (1 — yy G0) + ys (a) 9 viy 
jzj 
nc 
—yij + Yj (21) 5 Yi,j' 
j=l 


yj (zi) — vij 
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because’? 3777 Yig = 1. We deduce that: 


OL; (0) a by 9€ (yy (x An p ð yj" ( Ti ) 9 vi jr 


OYj,h jt PR xi) O vi, jn OYyjh 
" Y OE (yy (vi) ui, j/) oui s. 
£0 byy (xi) Ov 7 


E 5 E (yj (xi), Yi j) Oyy (x) 


Zin 
jal O yj (xi) Ovi j 
FE (yj (xi) — Yi, j) erg 
and: 
OLO _ SS AE (yy Cr) ma) SS Ouy Gr Dn Dzin ia 
O Bh,k a= O yy (zi) p= Ovijv Ozin OUi,n O bh,k 
< OE (yj ( zi) s Oy; ( zi) 
= 29 Dy; ( zi) > LIV Vi" ,hTi,k 
j'-1 


«A [XS OE (yy (wa) vis) I yy (vi 
LSS € (yy (21) yi) Oyy (21) 


saen OMe) O vij” La 
nc 
= 5 (yj (2i) — Yi jr) Yj”,h | Ti,k 
gat 
nc 
L 5 (yj (zi) — Vi.) Vj,h | i,k 
j=l 


The matrix forms are then: 


and: 


OL (0) ( ( ^ 1 

=((¥-¥)7) x 

8B T 

8. We have Og, o Ui h 2 Onn, o Vij — ] and PPM Vi j = vii. We note Bo the vector of 
dimension n; x 1, yo the vector of dimension n, x 1 and yz the matrix of dimension 
ny X Ng. If there is a constant between the z's and the z's or between the z's and the 
y's, we have: 


O0 £ (0 
(o) = (((Gy,y © Gy) ) © Gi.) lnxi 
ð Bo 
and: à £ (8) 
Ó ^o = (Gy.y © Gy). laxi 
In the case of direct links between the z's and the y’s, we obtain: 
OL (0) 


i 
an = (Gy, © Gyw) 


10 All values Yi,j/ are equal to zero except one value that is equal to one. 
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15.4.8 Primal and dual problems of support vector machines 
Hard margin classification 


1. We note 0 = (8o, B) the (1+ K) x 1 vector of parameters. The objective function is 
equal to: 


1 
£(0 = gll 
b T 
59 Q0—0'R 
0 Of 
9m ( o.) 


and R = 0,, y. The constraints y; (Bo + z] B) > 1 are equivalent to y;ßo + yz; B > 1. 
The matrix form C0 > D is defined by: 


where: 


Yı JVyiYXii co^. VuvikK 
d ME 
Un Unn,1 Uie Unn,K 


and D — 1,. 


2. The associated Lagrange function is: 


L (Bo, 8; o.) 


1 
2 elles — a! (ypo +yO XB- 1n) 
1 
= Ug ll8lI5 — (aly) bo- a! (yO X8) - a' 1, 


= T — Bo D: n) — pt (>: ons toi 
i=l gl i=l 


where o > Oy is the vector of Lagrange multipliers. The first-order conditions are: 


AL(Bo, B30) «x 


and: 


ƏL (Bo, f; : 
= a) - 3 oiv = 0g 


i=1 
3. At the optimum, we deduce that 8 = pom o4yi;r; and the objective function of the 
dual problem is: 


TL T n n 
L*(a) = : bs wn (>: ws — Bo (>: ou] — 
PR s E n i=l 
(>: sn is wn + 5 Qi 
2m i i=l T . i—1 
= 5 Qi — ; (>: ama) (>: awa) 
i=1 i=1 i=1 


n 


n n 
= Ja- j Y senastzs 
jaN 


i=1 j=1 
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Then, we have: 


D 


n n n 
1 T 
= argmax 5 'ai E 32,2, 00V; Tj 
i=} 4—1 j—1 
st. 020, 
because the Lagrange multipliers a; are positive. 


. It follows that: 


1 n n n 
â = arg min 5 1 1 o;05yiyjt, Lj = 1 Qi 
i=1 


i=1 j=1 
Since we have X; ] aj =a! L, and: 
n n 
>D » ojo 5yiyjt, vj =a'Ta 
i=1 j=l 
where T; j = y;yjv; xj, we deduce that: 
1 
â =  argmin j^ Ta -a'in 
s.t. a>O, 
. We notice that: 
C02 D & -C90 < -D 
By applying the direct computation, we deduce that: 
1 
â = argmin 50 Qa aol R* 
s.t. a>O, 
where: 
Q* = Coc 


= c( 0 0; Jed 


Yı Viii coco yQvLnkK s 
= ; . . ( ~ OK 
I A ; Ox Ik 

Un Ynn, ^'' Yntn,K 

m 

Yı YıTıiı co YK 

Yn Ynn, ^'^ Yntn,K 
= oo (y y) +r 

where T; j = Viyj ti 2; and: 
R = -CQ'R«-D 
= -—0Q!0-1, 
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Finally, we obtain: 
1 
á = argmin so (co (y! y) + T) o — al 1, 
s.t. «20, 


'The singularity of the matrix Q does not allow to define a proper dual problem. In 
particular, we observe a scaling issue of the Lagrange coefficients. This is why we 
reintroduce the constraint Pp 24 049i = 0, and replace the previous dual problem by: 


1 
a =  argmin z2 Ta ds 


Wigs tas 
s.t. gag 
a > On 


Soft margin classification with binary hinge loss 


1. 


We note 0 = (Bo, 8, E) the (1 + K +n) x1 vector of parameters. The objective function 
is equal to: 


sO = RCE E 


i=1 


1 
= a -8'R 
where: = z 
0 Ok 0, 
Q—| Ox Ix OKxn 
On Onx K Onxn 
and: 


= Ori 
n= ( 085, ) 


'The constraints y; (Bo + xi B) > 1 — £; are equivalent to y; 89 + yix; p + £i > 1. The 
matrix form C0 > D is defined by: 


yuodqaiXua coc wXXik d 0 


Un Yntn1 ^'' Yntn,K 0 1 


and D = 1,. The bounds £; > 0 can be written as 0 > 0^ where: 


o —oo: lik 
d | 


2. The associated Lagrange function is: 


£(fo.8,6a,) = FIAS E- 


i=1 


RT yi (Bo +x; B) — 1+ &) -Zr 


i—l 
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where a; > 0 and A; > 0. The first-order conditions for £y and 8 are the same: 


OL (Bo, Bia) _ aud) -0 
i=1 


ð Bo 
and: 
oC Cres) a ee 
aB = B 3 OUyidi = Ox 
The first-order condition for £ is: 
OL (Bo, b; a) 
JE =C-1,-a-rA=0, 


It follows that: 


L*(a,A) = L*(a)t+ OD = 3 aiki ES 5 Aiki 
i=1 i=l i=l 


= £'(o)* 5 &(C a4 X) 
i=1 
= ££ (a) 
because of the first-order condition for £;. T'he objective function of the dual problem 
is then the same as previously, and does not depend on the Lagrange multipliers A. 


However, A; > 0 and C — a; — A; = 0 implies that C — a; > 0 or a; € C. Finally, we 
obtain the following dual problem: 


1 
â = argmin 5a'Ta—a" 1n 
Ta > 
s.t. pam 
0,<a<C-l, 


. Previously, the support vectors correspond to observations such that a; 4 0. Here, 
the support vectors must also verify that £; = 0, implying that A; Z 0 or a; 4 C. 
Therefore, support vectors corresponds to training points such that 0 < o; < C. 


. The Kuhn-Tucker conditions are min (Aj, €&;) = 0. We also have A; = C — oj. If a; = C, 
then A; = 0 and é; > 0. Otherwise, we have A; > 0 and £; = 0 in the case where a; < C. 
The two constraints y; (80 + SES > 1-— £j and €; > 0 implies that: 


& 21— yi (bo +z} 8) 


At the optimum, we deduce that: 
£j = max (0,1 — Vi (4 +27 8)) 


. In Figure 15.3, we have represented the optimal values of £o, £1, (a, bm £; and the 
margin M with respect to C. We notice that the paths are not smooth. We verify that 
the soft margin classifier tends to the hard margin classifier when C — oo. In Figure 
15.4, we show the optimal hyperplane when C — 0.07. We verify that the soft margin 
classifier has a larger margin than the hard margin classifier. 
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Ié and M2 


pomp 


— -— Hard margin 


Soft margin 


FIGURE 15.4: The hard margin classifier when C = 0.07 
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Soft margin classification with squared hinge loss 


1. We note 0 = (bo, 8,£) the (1 + K + n) x 1 vector of parameters. The objective function 


is equal to: 
1 TL 
FO = glBl;-C»,8 
i=l 
1 
= 3? Q0 -0'R 
where: 
0 OK 0/ 
Q—| Ok | Ik OKxn 


On OnxKk 2C.lL, 


and R = 01, 44. The inequality and bound constraints are the same as the ones we 
have found for the binary hinge loss. 


2. The associated Lagrange function is: 


Llf bga) = je CY E - 


i=l 


a (vi (Bo + x 8) -1+&) ee 
i=1 


i=l 


where a; > 0 and A; > 0. The first-order conditions for £y and f are: 


DAC cs as 
i—1 


ô Bo 
and: 
AL (Bo. B: o) - 
= iyiti =O 
0B B 2. Orid K 
The first-order condition for € is: 
OL (Bo, p; a) 
EE T Sus —a—-=0, 
DE 2-C&E-a-X 


The Kuhn-Tucker conditions are min (A;,£;) = 0, implying that A;£; = 0. This is 
equivalent to impose that 2C - £ — a = 0, or £j; = ac It follows that: 


Qi 2 Qi 
CE — af — Aib — c (55) -zg 0:6 
o? oa? 
~ 40 2C 
2 02910 
4C 


and: 


Cla = £ a-p 
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Finally, we obtain the following dual problem: 


1 1 
@ =  argmin 3* (r + sol) a= ly 
T 
: y a—0 
s.t. { a > On 


This is a hard margin dual problem with a ridge regularization of the I matrix. 


. In Figure 15.5, we have represented the optimal values of fo, £1, 82, p £; and the 


margin M with respect to C. We notice that the paths are smooth. We verify that 
the soft margin classifier tends to the hard margin classifier when C — oo. 


By 


FIGURE 15.5: Convergence of soft margin classification with squared hinge loss 


4. We obtain Êo = 0.853, 8; = —0.371 and f, = 0.226. The optimal values of a; and £; 


are given in Table 15.3. 


TABLE 15.3: Soft margin classification with squared hinge loss (C = 1) 
i 1 2 3 4 5 6 7 8 


âi | 0.00 0.00 1.16 0.00 0.00 0.00 0.22 2.48 3.13 
£; | 0.00 0.00 0.58 0.00 0.00 0.00 0.11 1.24 1.56 


Soft margin classification with ramp loss 


1. We have represented the four loss functions in Figure 15.6. The 0-1 loss function is not 


convex. The binary hinge loss function is convex, but not always differentiable. The 
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squared hinge loss function is convex and everywhere differentiable. Finally, the ramp 
loss function is bounded. The last three functions can be viewed as an approximation 
of the 0-1 loss function. Graphically, the ramp loss function is the best approximation. 


E 

.75 

: e — 0-1 

3 =— Binary hinge 
50H K qhevece Squared hinge 

* T--: Ramp 

Zo 
00 = = Gan -e- 


0.50 


0.00 \ à MI as : à - zs 
—-1.0 —-0.5 0.0 0.5 1.0 1:5 2.0 


FIGURE 15.6: Comparison of SVM loss functions 


2. We have: 
Lem? (ari, ys) = min (1, 5S. (aj, y))) 
min (1, max (0,1 — y; (bo + xj 8))) 
= max (0, l—yi (Bo + vj B)) = 
max (0, —yi (Bo + x; B)) 
= LE (r; yi) — LOO CE au 


It follows that L"®™®P is not convex, making the optimization problem tricky. 


LS-SVM regression 
1. We note 0 = (bo, 6, £) the (1 + K + n) x1 vector of parameters. The objective function 


is equal to: 
1 m 
£0 = 3l8l5- C e 
i=1 
lam T 
= 3? Q0—0' R 
where: m " 
0 Of 07 
Q—| Ox Ix OK xn 


On OnxK 2C.l, 
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and R = 014K4n. Let Y = (yj) and X = (zix) be the output vector and the design 
matrix. The equality constraint is A0 = B where: 


A Ve X Iw) 
and B=Y. 


2. The associated Lagrange function is: 


£(fo.8,60) = jlBlbec d. 
$= 


n 


5 oi (yi — Bo — 2; B — &) 


i=1 


The first-order conditions for 69 and £ are: 


LAU ey ore, 
i=1 


ô Bo 
and: 
OL (Bo, B; a) - 
0B B 2. OTi K 
'The first-order condition for £ is: 
OL (Bo, B; a) =) x 


Since we have a! 1, = 0, 8 = X'a and £ = o/ (2C), the objective function of the 
dual problem is equal to: 


* 1 Z 
& (a) = zll +e et 
i=1 
Sam- oY ai- (Soe!) 8- Sook 
i=1 i=1 i=1 i=1 
1 — 
= P P OE aY — Bo: o'1,— 8 B—o'€ 
lx C =z T ala 
gage ee 26 
T T m oo 
= alY Ro qms 
a (o ) 48 
Finally, we obtain the following dual problem: 
â =  argmin a xx ede a-alY 
2 2C 


s.t. alln =0 


3. We also have B = X'@. In this problem, all the training points are support vectors. 
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We deduce that fg = n7! 37, (v = a7 8) and Ê = y; — ĝo — x] B. We verify that 
the residuals are centered: 


e-SVM regression 


1. We note 0 = (89, 8,€7, E+) the (1 + K + 2n) x 1 vector of parameters. The objective 
function is equal to: 


FO = ;leliecY E +64) 


i=1 
1 
= -0'Q0—8'R 
2 
where: 
0 ok 03, 
Q—| Ox Ik OK x2n 
02; O2nx K O2nx2n 
and: 
0 
» 0x 
BEN ghee 
—C-1, 


The constraints Bo + x; 8 — y; <e+& and yi — fo — x] B X e+ £7 are equivalent to 
Bo— xl B+E > —yi — £ and fo +x] B+ £f > yi — €. The matrix form C0 > D is 
defined by: 


and: 


The bounds €> > 0 and nd > 0 can be written as 0 > @ where @ = 
(—0o i lix; 0n, 0n). 


2. We introduce the Lagrange multipliers a; > 0, af > 0, A; > 0 and A? > 0 associated 
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the four inequality constraints. The associated Lagrange function is: 


LO = gleli«cY E +E) - 


i=l 


S az (e+ E — Bo - 278 +y) 
i=l 


X at (e+ & + Bo +2) 8- y) - 
il 


n 


Seo we 
i=l 


i=l 


The first-order conditions for 89 and £ are: 


and: 


We deduce that: 


and: 
B= XT (at — a`) 


The first-order condition for £^ and £* are: 


OL(-) = = 
= .1 = = =0 
JE- C-i n-a À x 
and: aL.) 
: — .1 — el ee + 
JE C-1,-a À 0, 


This implies that: 


It follows that: 


= x (o^ + at)! 1, — Po (a7 — at)" 1, - 
(a7 — at)" X84 (a — at) Ya 


n n 
Sai +o ater 
i=l i=l 
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We also have: 


Cy (& -*&) = CMi&g cM 
i=l i=l i=l 
= J (ar +A) +D (of X) 


i=1 i=1 
Since (a7 — at)" X = 6", the objective function of the dual problem becomes: 


Ll- e(a7 +a)" In + 8 8- (a7 -a*) Y 


Il 


£'() 


Il 


1 
8 8 € (a7 4 at)" 1, — (a7 — at)" Y 
Since we have A; > 0, A? > 0, C- 1, — a7 — A- = 0, and C: 1, — at — At = On, 
we deduce that a; € C and a+ < C. Finally, we obtain the following dual problem: 
1 
[à ,à*) =  argmin 5 (a7 — at)" XX! (a —at) + 


e(a tat) 1, 4 (a7 —a*)' Y 
1l (a^ —at)=0 
s.t. 0,<a <C-l, 
0, <at<C-1, 


3. We note 6 = (a^ ,a*) the 2n x 1 vector of parameters. The QP objective function is 
equal to: 


f(0)- 50° Q0— O'R 
KE EXPE 

a= ( SNe A ) 
-Y-e-l, 

Pur ( Y-—e-:1,4 ) 


The equality constraint is A0 = B where A=( 1} —1, ) and B = 0. The bounds 
are 07 = 05, and 0* = C - Lan. 


where: 


4. We have: 


The Kuhn-Tucker conditions are: 


min (a; ,€ £i — Bo zi | yi) —0 
min (afe £ + By +e) 8 — yi) =0 
min ee =0 
min (AFET) =0 


We also remind that C = a; +A; = af 4 Aj. The set SVT of negative support 
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vectors corresponds then to the observations such that 0 « o; « C. In this case, we 
have e --6; — bo — x] B -- y; = 0 and €> = 0. We deduce that fp = y; +£ — £] B when 
i € SV. The set SV? of positive support vectors corresponds to the observations 
such that 0 < af < C. In this case, we have € + £f + bo + x} B — y; = 0 and £f = 0. 
We deduce that fo —-yi—E-z] B when i € SY". Finally, we obtain: 


ô Desv- (vi TES alp) Uc P iesv* (vi lo x] B) 
o = 


Nsy- T Nsyt 


where ngy- and ngy+ are the number of negative and positive support vectors. 


5. If o; < C, we have A; > 0and €; = 0. Otherwise, we have A; = 0 and €; > 0. 
More precisely, we have £ + £j — Bo — x; 8+ yi = 0 or: 


& =- (yui +e- ĝo- a7 Â) 


If of < C, we have Af > 0 and &f = 0. Otherwise, we have Af = 0 and €7 > 0. 
More precisely, we have e+ £f + fo + x] B — yi = 0 or: 


Et = yi -e -— ĝo- z] B 
6. When € is equal to zero, the term e (a7 + at)! 1, disappears in the objective function 
of the dual problem: 


T 


-£*()2z(a —at) XX! (a —at)+(a —o*) Y 


By setting 6 = at — o^, we obtain the following QP problem: 


ô = argmin $3X X75 = 9 Y 
y. 115-0 
Bue dps ipee L udo 


The bounds are obtained by combining the inequalities 0,, < a7 < C - 1, and 0, < 
at < C- 1,. We have B = X à. The set SY of support vectors corresponds to the 
observations such that —C « ó; « C. It follows that: 


A 1 a 
Bo = — Ð (w - 278) 
"o ney às 


where nsy is the number of support vectors. Moreover, we have: 
& —1 D = -c) - max (o. - (vi — Bo — 2} B)) 
Et =1{6, B c) max (0, y — ĝo — 27 Ê) 


15.4.9 Derivation of the AdaBoost algorithm as the solution of the ad- 
ditive logit model 


The following derivation comes from Section 10.4 in Hastie et al. (2009). 
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1. The objective function is equal to: 


L (Bs), f(s) = DIL (yi Go-1) (9) + bife (2) 


i=l 


n 
T 5 e ico (#4) + Bs) fc) (x:)) 


i=l 


n 
= X wise VPI) (xi) 
i=1 


where the expression of Wi s is equal to: 


Wis = eT Yie- (Ti) 


2. Since we have yi fis) (xi) = 1 if yi = yi; and yi f(s) (zi) = —1 if yi FA Yi,s, we obtain: 


II 


n 
wise MOF) . 1 (yi = yis} + 
i—l 


L (Bs). ft) 


n 
5 wi ,e Po fo e) . 1 {yi Æ Yi s} 


j=] 
n 
= efo Ņ wis: 1 {yi = Yis} + 
t=1 
n 
cP) S wis {ys F yis) 
t=1 


We notice that: 


n n n 
x Wis:liyi = Yis} = 5 Wi,s — 5 wis l {yi Ż vis) 
i=1 i=1 


i=l 


It follows that: 


L (Bs); f) = e Ms) »3 Wis + (e%) = e Po) 5 Wis: 1 {yi X Yis} 
i=1 i=1 


n n 
= (eto + (e%) — e Pc) Xiz d I i E 2) Ja Wi,s 
el Wi,s i=l 


= ((e% - e Pc) Lis) + e Po) 5 Wi,s 
t=1 


where £(,; is the error rate: 


eS Wis l UE Vi,s) 
Yaa Wi,s 


3. It follows that the minimum is reached when: 


IL (Hs), fis) 
OB 


Lis) = 


=0 
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We have: " 
OL Vo. ft) = ((e&c i e A) Lis) _ e P) 2. Wis 


We deduce that: 
(eP + e P0) Ls) — e 9o = 0 


If we consider the change of variable a = e? > 0, we obtain: 


1 1 
(s 1)5o- i-o 
a a 


or: 
auc b eg) 
Lis) 
The solution is: 
«4 [12 Eo) 
Lis) 


The optimal value of 5(, is then: 


ee tede e 
gre 


. We have: 


s—1 
Hs) (x) = X fen fen (£) + Êi) f) (2) 
s'—l 
= ĝen) (2) + Bi fs) (2) 
It follows that: 
Wi s41 eT Ys) (2i) 
L e idco) Yi Bis) f(s) (#4) 
= wi e7” Po fe) (a) 


Using the fact that —y; f(s) (vi) = 2-1 (yi # ĝi,s}—1, the expression of w;,s41 becomes: 
Wisi] = Wi epo Mis e- Bc) 
= WwW; eus Mila e- Bc) 


where: 


'The normalized weights are then: 
wi ews Muss o7 Bc) 
PR wy ges Mw tdv s f e-o 
w; gees Miis) 


cd wy gers Tues] 


Wi s+1 = 
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5. The AdaBoost model can be viewed as an additive model, which is estimated using the 
forward stagewise method and the softmax loss function. However, there is a strong 
difference. Indeed, the solution f(, is given by: 


Âs) = arg min > wi ,e "lir fs) 
i=1 


because —yi bts) f(s) (xi) = 28 s) M (yi Z yis) — Bos) and: 


£ CEG = e Bo Sue eee 


i=1 


In the AdaBoost algorithm, the objective function for finding fis) is exogenous. 


15.4.10 Weighted estimation 
1. (a) We have: 
0 = arg max £u (0) 


(b) The Jacobian matrix is: 
Jy (0) 2 w! © J (0) 


where J (0) is the Jacobian matrix associated to the unweighted log-likelihood 
function £ (0) = $77, £; (0). For the Hessian matrix, we have: 


Hy (0) = ye w;H; (0) 


where H;(0) is the unweighted Hessian matrix: 


8? £; (0) 8? £; (0) 8? £; (0) 
80,00, 00,00:45(0 00,00K 
8?£; (0) 8? £; (0) 8?£; (0) 
H;(0)— | 00,00; 805 06, 80500y 
0? e; (0) 8? £; (0) oL: (0) 
06x00, 00K 00> 00k 00k 


2. (a) The least squares loss function becomes: 


n Ny 1 
Lw (0) = Nw; 2 (yj (2) = Vij)” 
i=1 — j=l 
We also have: ILa (6) 
gy = (6540 Gyv)' (w0 Z) 
and: ET 
23 | = (((Gy,y © Gyw) Y) © Gu (w@X) 


where the matrices Gy y, Gy, Gz,4, Z, X and y are those defined in Exercise 
15.4.7 on page 303. 
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The cross-entropy loss function becomes: 
Lw (8) = — 5 wi (yilny; (wi) + (1 — yi) (1 — ny; (a) 
i=1 


We also have: 


EL = (f Y) (w © Z) 


and: 


2620) - (((e-v)))ezea- 2) wox) 


where the matrices Y, Y, Z, X and y are those defined in Exercise 15.4.7 on 
page 303. 


The soft margin classification problem becomes: 
^ ^£ pda yn z 

{ôo pE} = argmin z Ela CY wiki 
i=1 


for i = l,...,n 


st { yi (Bo +a] p) >1-& 
T C0 


For the primal problem, the difference concerns the vector R: 


_( Oką 
a= (8) 
For the dual problem, the difference concerns the bounds: 


1 
á = argmin 5^ Ta ETE 


3i y'a=0 
ME 0, <a € Cw 


It follows that the support vectors corresponds to training points such that 0 « 
ai < Cwi. 

Hard margin classification assumes that the training set is linearly separable. 
So, there is no impact of weights on the solution. This is why it is impossible 
to introduce weights in the objective function of the hard margin classification 
problem. 


Appendix A 


Technical Appendix 


A.4.1 Discrete-time random process 


1. (a) We have zo € {0} and Fo = {0}. For t = 1, x can take the value 0 or 1. We 
deduce that xı € {0,1} and Fı = {{0}, {1}, {0,0}, (0, 13]. Similarly, we have 
x2 € (0,1, 2) and: 


_ S 10}, {1}, {2}, {0,0}, { 
fius { {0,0,0}, {0,0,1 


, 


{0,2}, {1,1}, {1,2}, \ 


0,15, 
1, {0,1,1}, {0,1,2} 


(b) We have! E [|X (t)|] = t,/2/m < oo. It s < t, we have: 


[X| F] = E[Xi1 + Er | Fs] 

= y |x. + eo Et—n | F.] 

- E[X, E] +E XA ern | Fo 
= Ts + 0 


= Ts 


We deduce that X; is a martingale. 
2. We have: 


Xt = Xite 
= ó$(óXi sad 61) + & 
= Xia t+ pEr tE 


oo 
TL 
x X @ Et—n 
n=0 
because limp, @” = 0. 


(a) We have: 


TSee Question 1(c) of Exercise A.4.2. 
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Vt € Z, we have E [Xi] = E [X zo $" Et-n] = 0. We deduce that: 


i [Xi] =E [Xs] 


V(s,t) € Z with t > s and Vu > 0, we have: 


[Xs f uXt f A = i p» Pr Estu—n 5 dence 
n=0 n=0 


= aps Q" Estu—n 5 Sern 
n=0 


n=t—s 


oo oo 
~ t— 
m 4 X Q" Estu—n X [o) STU buc 
n=0 


n=0 


= g ?E pe Lm 
n=0 

gis " 

1-42? 

= E[X,Xj] 


We deduce that .X, is a weak-sense stationary process. 


(b) We have: 
POS eA - f vi- g | Coe) ae 
A 


o? y 2T Sap 20? 
The probability IP {X; € A} does not depend on t. We deduce that: 
P{X,€ Ay =P{X, € A} 


(c) We have: 
E[X: | Fi} = E[éXi-1+ & |i] 
= mra 
* Tt—1 
It follows that X, is a Markov process only if ¢ is equal to 0. 
(d) We have: 
j [x?] = J [2 + 853] 
= (146%) 0? 
< oo 


Vt € Z, we have E[X,] = E [e; + 02, 1] = 0. We deduce that E[X;] = E[X,]. 
V(s,t) € Z with t > s and Vu > 0, we have: 


[XspuXt+ul = E[(€stu t+ O€s4u—1) (€t t+ O€t+u-1)] 
= E [es4uétpu] + OE [Es+u—1Et+u] + 


So 


) [EspuEt+u 1] + 6? i [Es+u 1Et+u 1] 
(1+0?) o? ift=s 

= 0c? if t- s|=1 

0 if t- s|>1 


A.4.2 Properties of Brownian motion 


1. 


Technical Appendix 


Since we have E[X.4uXt+u] = 
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2 [Xs X+], we conclude that X, is a weak-sense 


stationary process. It is easy to show that the probability P ((.X, € A} does not 
depend on t. This implies that X+ is a strong-sense stationary process. We have: 


S [Xs | Fu] 


II 


+ 
4 


i [es + 0641 | F1] 


Oer—-1 


eii + 064.5 


Tt—1 


The MA(1) process is not a Markov process. 


(a) We have: 
5 [W (t)] :[W (t) — W (0)] 
= 0 
(b) We assume that s < t. We have: 
cov (W (s) W (t)) = E[W (s) W (t)] 
= EIW (s) (W (t) - W (s) + W (s))] 
= E[W (s) (W (t) - W (s))] + E [W° (s)] 
= E[(W (s) - W (0) (W (t) - W (s) + 
(W (s) - W(0))?| 
= 0-cs 
(c) We have: 
; zog ab rus 
(|W ey] = a OR d 
= 2 | ie da 
2 Ldbg d 
eee 
E 
< a 
and: 
[W (t) | Fs] [W (s) + (W (t) - W(s)) | Fs] 
= E[W(s)| Fs] +E[W (t) - W (s) | Fs] 
= w,+0 
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2. We only consider h — 0* because the case h — 0^ is symmetric. We have?: 


P(W(--h)-W(»z = ae ] exp ( z”) dis 


-œ V2Th 
my ud 1 
= 2 | ec dk (haere) Vh dy 


ll 
ins 
oO 
Ps 
ue) 
ure 
S| M 
N 
— a 
= 
g (æ) 
[ey 
| 
SR 
| 
si 
e 
d 


We note f (y) = zd — T We have: 


ovh 


Therefore, f (y) is an increasing function on |—o0, —h~1/?e] and a decreasing function 
on [—h-1/2e, 0]. We deduce that: 


and: 


where C € R+. It follows that: 


2 
0c P(W(t-h)-W(t)|-se& Cy ex (-=) 
and: 
2 e? 
< li = < ME Li 
0< ,im, P{W (t +h) W (t)| >e} < Cy im exp ( x) 
Since lim; ,9« exp (-&) — 0, we deduce that: 


Jim, P{W (t-+h) -W (| > e} - 0 


3. We have: 


[W (t)] 2 t 


and: 


[WEAF] = E[(W(t)-W(s)+W(s))?|F] 


Il 
© 
I 
T 
oO 
+ 
— 
~œ 
| 
WD 
— 
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If X ~ N (0,1), we know that E[X?] = 0 and E[X*] = 3. We deduce that 
z [W° (t)] = 0 and: 


: [W^ (¢)] 


II 


: (e 0.2) 


= 30? 


We remind that E [exp ( (u, o?))] = exp (u + 0.507). We deduce that: 


y je" | = eàt 


and: 


J |." 


i] E: Add W (s) 


z] 


eW (8) ; [ee W(s) 


z 


eW(s) e} (t-s) 


= eł(t-s)W(s) 
4. If X ~N (0,1) and n € N*, we have: 
[X] = (25-1) [xe » 
= (2n—1)(2n—3) | xim 2| 
(2n — 1) 2n —3)---5-3-1-E [X?] 
= (2n—1)M 
and: 

t [X] 20 


where n!! denotes the double factorial. We can also show that: 


It follows that: 


and: 


[w^ (5| 20 
For an even integer n, we deduce that: 


n! 


~ 2°72 (njg)l 


p 


[W'" (t)] 


whereas for an odd integer n, E[W™ (t)] is equal to 0. 
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A.4.3 Stochastic integral for random step functions 


1. We note (*) = f? (af + Bg) (t) dW (t). We have: 


(*) 


BY g (ti) (W (tis) — W (62) 


p ] 
af re awas | g(t) aW (t) 


We conclude that the stochastic integral verifies the linearity property. We now intro- 
duce the following partition: 


We deduce that: 


b 
f f(t) aw (t) 


a=to «ti «c «tj-cc«c <tn <b 


=. f (ti) (W (tig) — W (t) 
1=0 
k—1 
= XOS) (W (tis) - W (8)) + 
i=l 
Y F (ti) (W (tis) - W (62) 
rc i 
- fto aw (+ f f(t) dW (0) 


It follows that the Chasles decomposition property holds. 


2. We have: 


p 
|y Ho are) = |E rowem- wn 


i=0 
= x [f (ti) (W (t3) — W (ti))] 
i=0 
ah [f£ (6)] E [W (t3) - W (t;)] 
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We note (+) = e f(t) aW (t) f? g(t) aw (t)]. We have: 
DN [i rte ) (W (5543) - MALAM 
5 i—0 
z [E ^ f) g (ty) QV (tint) - W (5) (W (53) ve) 
4—0 7=0 


We deduce that: 


This result is known as the Itó isometry property. It is particularly useful for comput- 
ing the covariance between two Itó processes X; (t) and X» (t): 


b b 
cov (X1(t), Xi (t)) = E / ai (t) dW ef c» (t) dW | 


b 
= Af CO d 


where c (t) and o» (t) are the diffusion coefficients of X, (t) and X» (t). 


3. We remind that: 


332 Handbook of Financial Risk Management 


b b 
«(| rowo) - J [read -o 


Since the mathematical expectation and the Riemann-Stieltjes are both linear, we 


conclude that: 
b b 
var | / f (t) ave) - i : [f? (t)] at 


A.4.4 Power of Brownian motion 


It follows that: 


1. We apply the Itó formula with u (t,x) = 0, o (t, x) = 1 and f (t,x) = x”. Since we 
have ô; f (t,x) = 0, 0, f (t, x) = 2x and 02f (t, x) = 2, we deduce that: 


dw? (t) df (t, W (t)) 


Il 


II 


(0+2wx0+5x2x1) dt + (2W (t) x 1) dW (t) 


II 


dt + 2W (t) dW (t) 


2. It follows that: i ; t 
J awos [ ast [ wy awe) 
0 0 : 


who cea [we dW (s) 


and: 


Therefore, we obtain: 


I 


T(t) n W (s) dW (s) 
= 5 (W? (1) -1) 


If follows that the expected value is equal to: 


|f wo) aw) 


I 
FL 
| 
= 
N 

— 

+ 
— 

l 

~œ 

nN 
[m | 


Concerning the variance, we obtain: 


var ([ wo aw (s)) = var (5 (W? (0) ~1)) 


wl 
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We also notice that we can directly find this result by using the It6 isometry property: 


ae ([ wo aw (s)) (fre ares) | 


t 


II 


£ [W? (s)] ds 


3. We use the function f (t,x) = x”. We have ôf (t,x) = 0, Onf (t, x) = nz and 
02 f (t, x) = n (n — 1) z"-?. The Itó formula gives: 


aW” (t) = df(t, W (8) 
= sn (n—1) W(t)" dt + nW"-! (t) dW (t) (A.1) 


4. Since I, (t) is an Itó integral, we have: 


and?: 


var (I, (t)) = [i z [W?" (s)] ds 


| 
2—,. 
NIT 
N 
|= 
3 
e 


1 $ 
= (27)! s" ds 
2"n! Jo 
(2n)! [e 
2^ (n 4- 1)! 


Finally, we obtain the following values of var (I, (t)): 


n l1 2 3 4 5 6 
I5 315 l 
var(I,(t)) it? t Bet 210 3220 14850 


5. In Question 4 of Exercise A.4.2, we have shown that: 


[44 (0] = E[W” (t)] 
Feat if T is even 
if n is odd 
Let us assume that n is odd. We have: 
var (J, (H) = E[W?"(t)] 
Qn)! n 
2nq! 


3We use the result obtained in Question 4 of Exercise A.4.2. 
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In the case where n is even, we deduce that: 
var (Jn (t)) = E [Ww (t)] — E? [W^ (t)] 
| (Qn) nt \?\ 4” 
(n)! (n/2)! 7} 2” 
6. We have: 
t 
[K,(t)] = E i W” (s) as| 
0 


L[W" (s)] ds 


f 


If n is odd, we deduce that: 


II 


t 
I Ods 
0 


Il 
o 


If n is even, we deduce that: 


i [Kn (£)] 


t n! 
* n/2 
| FRE 05 
n! 


t 
es VERI 

s? ds 
27/2 (7/2)! I 


n! | s”/2+1 | 
27/2 (n/2)! [n/2+1] 9 
n! 


2/2 (v T1) 


urs! 


7. Concerning the second non-central moment, we have: 


fen] 


| [K2 (t)] 


| 
Lf [whey to dsdu 
MA 


by using Fubini's theorem. The challenge lies in computing the term 


L[W" (s) W” (u)] dsdu 


(f wa) (f wo au) 


We face two difficulties. First, W” (s) and W” (u) are not independent. Therefore, the 
covariance involves the power series of W (s). Second, we must distinguish the case 
s < u and u > u. This is why it is long and tedious to compute the variance for high 


order n. 
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8. In the case n = 1, we obtain: 


Sy n iW (s) W (u)] dsdu 
= f Goin. u) ds) du 
- f C mintsu ) de f min (s) as) du 
P du 


dee 


var (Kı (t) E [KT (0] - E? [Ks (0) 


1 
3 
For n — 2 and s « u, we have: 
W? (s)W?(u) = (W (s) — W (0))? (W (u) - W (0))" 
= (W (s) — W (0))? (W (s) - W (0) + W (u) - W (s) 
= (W (s) — W (0))* +2 (W (s) — W (0)? (W (u) — W (s)) + 
(W (s) — W (0))? (W (u) - W(s))? 
Since W (s) — W (0) and W (u) — W (s) are two independent random variables, we 
obtain 


33 +2-0+s(u—s) 


2s? + us 


We deduce that: 
(e) = [ f ED Ww? qu sau 


t u t 
= f if (28? + us) as+ f (2u? + us) as) du 
0 0 u 
t (T2 beol 1 x] 
" Ts -L-us| + |2u?s + Td du 
0 3 2 0 2 u 
: 1 4 
= i (suni jut — zu) du 


1 1 1 
3 2,2 4 
t4 t 
u ig zl 
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and: 
var (K2 (t)) = E[K2(0] - E? [K2 (t)] 
Pad 9k e 
BE de (sat ) 
1 
= at 
For n = 3 and s < u, we have: 
W? (s) W? (u) = (W (s) — W (0)? (W (u) — W (0)? 
= (W (s) — W (0)? (W (s) — W (0) + W (u) — W(s))° 
= (W (s) — W (0)? +3 (W (s) - W (0)? (W (u) — W(s)) + 
3 (W (s) — W (0)* (W (u) - W(s))? + 
(W (s) — W (0))? (W (u) — W (s)? 
It follows that: 
. 6! 4 5 
i [W> (s) W? (u)] = (°) +3-0+3- Car ) (u—s)+0 
= 6s° + 9us? 
We deduce that: 
[KO] = | : [W3 (s) W° (u)] ds du 
0 


so So So FS. 


u t 
(65? + Jus”) ds «f (6u? + 9u?s) as) du 


t 3 u 9 t 
= E + us + outs + d du 
2 0 2 u 
t 9 
= (~ou + 6u?t + swe) du 
6 3 : 
= w+ Kutt + ube? 
5 2 6 
= 95 
5 
and: 
var (Ks (t)) = E[K3(t)] - E’ [Ks (t)] 
9 5 
= =t 
5 
9. Using Equation (A.1), we deduce that: 
t 
Bs J W” (s) dW (s) 
0 
1 n 
= n+l mM n—1 
= xp (t) JA W^ (s) ds 
1 n 
= net (t) — 554 (t) 
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and: 
m(t) = W(t) 
1 i n—2 É n—1 
= sn(n—1) f W (den f w (s) dW (s) 
= n0 DK, s) + nha () 


We also have: 


2 


(n4 2)n4.1) ^? (Ejus mj" (t) 


A.4.5 Exponential of Brownian motion 


1. 


We apply the Itó formula with u (t,x) = 0, o (t,x) = 1 and f (t,x) = e”. Since we 
have ð; f (t,x) = 0, 0, f (t, x) = e” and 02f (t, x) = e”, we deduce that: 


de"(0 = qf(tW (t) 


= sen dt + eW qw (t) 


and: 


t Dor t 
| deW‘) = J eW (s) ds +f eW (s) dW (s) 
0 2 Jo 0 


It follows that: 
t t 
eW( =1+ ;] eW (9 ds «f eWG) AW (s) 
0 0 


Or: 


X(t-214 ;Y (t) 4 Z (t) (A.2) 


. X (t) = eW() is lognormal random variable with E [X (t)] = e2* and var (X (t)) = 


e?* — et. In the case Z (t) = ds eW (5) dW (s), we have: 


and: 


var(Z(t)) = 


Il 
alia e E 
[u 
N 
N 
Q 
U 
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In the case of Y (t) = i eW (s) ds, we have: 


and?: 


uv = n : [e| as 


I 
=~ 
c 
Mp 
w 
Q 
wD 


M 
Y 
N 
zm 
Il 
1-———CH 
umm 
om 
ay) 
= 
o 
(em 
w 
Se 
Lamm 
oo 
c 
p 
= 
a 
2 
NV 
———— 


d. ; e" reme dsdu 
0 


t 
| e2 (st+ut2min(s,u)) Js du 


u t 
| elas f ehta ds) dii 
0 u 


II 


I I 
SDN EET ~ SO SO SS 
e CONLIN AT TUN 
$ 
w 
e 
® 
N 
g 
l 
| 
e 
NI 
TENE 


c 
N 
x 
l 
wI 
c 
Lv 
En 
+ 
N 


It follows that: 


var (Y (t)) = |( Faro a) | -E | f i eW (9) as| 
2 8 


Il 
a 
N 
S 
A 
Q 
= 
ze 
a 
Nik 
c 
N 


4We have 


E [W (s) + W (u)] = 0 and: 


var (W (s) + W (u)) 2 s +u + 2min (s, u) 
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3. From Equation (A.2), we deduce that: 


cov (Y (t), Z) = Í var (Y (0) + var (Z (0) — var (X (t)) 
1/2 o t, l6 2, B 
= G 4e 3° 2) +5 (6 1) 
(e et) 
1», 4 ito 
= —3* Tuus 1 


Finally, we obtain: 


Vae?! + 4fse'? — 1 
(se — 2e! + 8/se'/? — 1) (e?t — 1) 


In Figure A.1, we have reported the correlation p(Y (t), Z (t)) with respect to the 
time. 


-0.55 


—0.60 


—0.65 


—-0.70 


—0.75 


—0.80 


—0.85 


FIGURE A.1: Correlation between fn e" 9 ds and fy eW 9 dW (s) 


A.4.6 Exponential martingales 
1. We have z, = X (s) = eW (°) and: 
[ewe | Fal = 1 [ewe | F] 


Lo eW(s) [e t. we E 


2 eW(s)e} (t-s) 


£z Ts 
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Therefore, X (t) is not a martingale. 


2. The previous question suggests that m (t) is equal to e~2*. We have: 


M(t) = e 2teWH) 
eW(0- 8t 


It follows that: 


£[M (t) | Fs] 


II 


JE Pu 


= fe W(s)+W(t)— "eb x, 


pu M 
— W(s)—is 
M (s) 


Il 


3. By applying Itó's lemma with f (t, y) = e", we obtain: 


dM (t) 


Il 


I 


M (t) g (t) aW (t) 
It follows that: 


I 


puo 


= [io s) dW (s) 


M (t) — M (0) 


Since g (t) is not random, we deduce that Y (t) is a Gaussian process. We have: 


and: 


We deduce that: 


[M (t) | Fs] 


II 


i r Bf, Pm du f se awe | z| 


a is -3 fe 9? (u) dut f^ 9( (u) AW(u) |. 


e BJ) 9P) dut [7 glu) dW Q0 EJ 


= M (s) "EM g^ (u) duxi f’ 9? (u) du 
= M (s) 


We conclude that M (t) is a martingale. 


1 1 
(-5€ 69^) + 5e? (0) ac 090) aw t) 


(A.3) 
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4. We notice that M (0) = 1. From Equation (A.3), we have: 
t 
M (t) 21 «f M (s) g (s) dW (s) (A.4) 
0 


Since qd M (s) g (s) dW (s) is an Itó integral, we deduce that M (t) is a .F;-martingale. 
We say that M (t) is the exponential martingale of X (t) 2 g (t) and we have: 


[M (t)] = 1 


We also notice that Equation (A.4) is related to the martingale representation theo- 
rem: 


M(t) = i (0) f f (s) AW (s) 


where f (s) 2 M (s) g (s). 


A.4.7 Existence of solutions to stochastic differential equations 


1. We have u (t, x) — 1 -- x and ø (t, x) = 4. It follows that: 


In(5,z) —B(t,y) = li-z-1-y| 
< beg] 
and: 
lo (t,x) o (ty) -|4-4| = 0 
< ew 


We deduce that Kı = 1. Using the Cauchy-Schwarz inequality, we also have: 


In(52) - Hx] S nbl 
€ 4-(1- |z|) 


and: 
lo (t,x) = 4 
< 4-(1+ |z]) 


We deduce that Kə = 2. We deduce that there exists a solution to the SDE and this 
solution is unique. 


2. We have: 
Ip (tz) - n(t,y)) = la(b— x) - a(b— y)| 
= Ja]: |z -v 
and: 
lo (t,x) —o(t,y)| = lex — cy 
= |el-|e- 


By applying the Yamada-Watanabe theorem with K = |a| and h(u) = |c| u, we 
conclude that the solution exists and is unique. 
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A.4.8 It6 calculus and stochastic integration 
1. We consider the transform function: 
Y() = f(&X() 
(1-- t) X (t) — X (0) 
We have ô; f (t,£) = x, O,f (t,£) = 1 + t and 02f (t,x) = 0. It follows that: 


aea = (x@-a+9 12) dte (12) aw (ë) 
= dW (t) 
and Y (0) = X (0) — X (0) = 0. We deduce that Y (t) = W (t) and: 
_ X (0) - W (t) 
X (t) — TOSS 


2. Using f (t,z) = zg! — x71, we have Of (t,x) = 0, Oxf (t,£) = x? and 02f (t,x) = 
—2x~3. It follows that: 


E X(t) TIA) , X? (t) 
wo = Gaio) tego 
= dW (t) 
and Y (0) = 0. We deduce that Y (t) = W (t) and: 
1 
T0-7xcgy-wg 


3. We have: 


4. We have f (t,x) = (1—12)! z, &f (tz) = (1—t) ? x, O,f (t, zx) = (1— 1) * and 
02 f (t,x) = 0. It follows that: 


z X (t) X (t) -a 
dy (t) = (S zx dt + —7,dW (t) 
1 
zou dM 


We deduce that: , 
1 
Y (t) — Y (0) =} dW (s) 
0 1 ="S: 


Since we have X (0) = 0, it follows that Y (0) = 0 and: 


ra= aw (s) 
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5. Using Ito’s lemma, we have: 
dX (t) = (asewo) + zr ew i) dt + 0, f (t, W (t)) aW (t) 


Since X (t) is a martingale, it satisfies the martingale representation theorem: 


t 
X (t) 2 E[X (0)] +f Z (s) dW (s) 
0 
where Z (t) is a F;-adapted process. We deduce that: 
Z (t) = Oxf (t, W (t) 


and: 


OFW) | 109 f(,W (0) 
Ot 2 Oz? 
Then, X (t) is a F;-martingale if this condition is satisfied. 


=0 


6. In the case of the cubic martingale, we have f (t,x) = z? — 3tz, 0,f (t,x) = —32, 
0, f (t, £) = 3x? — 3t, 02 f (t, x) = 6x and: 


8f(,W(t) , 10? f (,W (t) 
Ot L Ox? 


= —3W (t)4 Sow (t) 
= 0 


In the case of the quartic martingale, we have f (t,x) = xt — 6tz? + 3t?, 0,f (t,£) = 
—6x? + 6t, zf (t, x) = 4r? — 12tx, 02 f (t, x) = 12x? — 12t and: 
18? f (t, W (t) 


8 f (t,W (t)) | B 1 
3 D = —6W* (t) + 6t+ 5 (12W* (t) — 12t) 
= 0 


We conclude that the necessary condition is satisfied for the cubic and quartic mar- 
tingales. 


7. We note f (t,x) = e'/? cos (x). Since we have Of (t,x) = le? cos (x), Oxf (t,£) = 
—e'/? sin (x), 02 f (t,£) = —e'? cos (x), we obtain: 


dX (t) = (se^ cos W (t) — je cos W (9) dt — e'? sin W (t) dW (t) 


It follows that: j 
X(t)=1 -f e*/ sin W (s) dW (s) 
0 


X (t) is an Itó integral. Moreover, we verify the condition: 


Jn ES 


€*? sin W (s) 


|^ 
T-————Tfí 
S= 
m 
e 
ü 
Q 
D 
LIU 


^ IA 


Then, we deduce that X (t) = e’/? cos W (t) is a martingale. 
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A.4.9 Solving a PDE with the Feynman-Kac formula 


1. The function g (t) = 1 satisfies the Novikov condition: 


ien (s[ 7 (s) as) — e? « oo 


We deduce that Z (t) = W (t) — te ds is a Brownian motion under the probability 
measure Q defined by: 
dQ t 1 ft 
dW (s) — = 
JP exp ( | (s) 7 1 ds 


Since we have dZ (t) = dW (t) — dt, we finally obtain that: 


dX(t) = dt+dZ(t)+dt 
= 2dt+dZ(t) 


. Under the natural filtration F+, we have: 


X (5) = X (t) + (5 — t) + (W (5) -W (t) 


and?: 


[X (5)| A] = x + (5 — t) 
If we now consider the filtration C, generated by the Brownian motion Z (t), we obtain: 
X (5) = X (t) +2(5 — t) + (2 (5) — Z(t)) 


and: 


[X (5)| Ge] = 2 +2 (5 — t) 
We deduce that: 


[X (5)|Go] = z+2(5-0) 
= «+10 


. We notice that: 


—ÓV (t, x) + 10V (t, x) Æ AV (t,2) 4-4 


where A; is the infinitesimal generator of X (t) with respect to the filtration Fy. 
'Therefore, we cannot apply the Feynman-Kac formula. However, by changing the 
probability measure, we have: 


avet Gay = jV (t,2) +20,V (t,2) +4 
= AV(t,x2)+4 


where A; is the infinitesimal generator of X (t) with respect to the filtration G+. We 


5We note X (t) =a. 
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can then apply the Feynman-Kac formula and we have: 


V(t,c) = E xok” Pale iud ase. 


t 


5 
= e36-9.E[X (5) G] «f 4e 36-9 qs 
t 


4 5 
= e 36-0, [X (5)| Gi] E -$even 
3 t 
4 
= (x+10-— 2t) e% + à (1 =: e3t-15) 
given that X (t) = x. We check the terminal condition: 
4 
V (5, x) = (x +10- 2x 5) e3X8-15 , a (1 _ e?X5=15) 
= 
We also have: 
&V(tz) = 3(x +8- 2t) e715 
a,V (t,2) = eX 


It follows that V (t,x) satisfies the PDE: 


—O,V (t,£) + 3V (t, 2) 


Il 


—3 (x +8 — 2t) £715 + A (1 — et719) + 
3 (£ + 10 — 21) e715 
= 2e3t-15 1 4 


1 
= zV (t,£) -- 20,V (t,£) +4 
. If the terminal value is V (T, x) = e”, we obtain: 
53a : "3d 
V (t,£) = Bled j f (ef. 3 j ds 
t 


e 36-9 . [e| 2 4 ; (1 m qs) 


a 


We have: 
X (5) |G ~N (x + 10 — 2t,5 — t) 


We deduce that: 


l 
a 


: E A — QUH10-2t-1(5-10) 


@+12.5—2.5t 


6 At the initial date t = 0, we have: 


5.4 5 
V (0,2) = (x 4-10)e715 + = (1 — e715) 
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We finally obtain the following solution’: 


V(t,z) = e 36-5 , pv +12.5—2.5t |. 4 (1 E pem 
f 3 


z— 4 — 
= gf-25405t 4 3 T 


We check the terminal condition: 


V (5, x) ET et 2-5+0.5x5 a : (1 p p 
= e? 
We also have: 
OV (t, c) = 0.5e%—2-5+0.5t m 4e3t—-15 
O,V (t, x) = et 2-5+0.5¢ 
aV (t, x) = et 2-5+0.5¢ 


It follows that V (t,x) satisfies the PDE: 


2 5e*— 2-940.5t deb 
= 0.5e*-2-540.5t KO Qet—2.5+0.5t A 4 


—O,V (t,x) + 3V (t, x) 


= lav (t,£) + 20,V (t,£) +4 


A.4.10 Fokker-Planck equation 
1. If we consider the following PDE: 


{ —ÓV (t,x) = $0?82V (t, x) + (a(b — x)) OV (t, x) 
V (7,2) =1{2=27} 


the solution is given by the Feynman-Kac formula: 


V(&z) = EB(X(T) = {xr} |X (t) =a] 
= P(X(T)-er| X(t) =2} 


We have: 
9, [a (b — x) U (t, x)| = —aU (t,x) + a (b — x) GU (t, x) 


and: 
82 jor EL (t, x)| 67020 (t, x) 


We deduce that the Fokker-Planck equation is: 


{ OU (t,x) = aU (t,x) — a (b — £) O,U (t,x) + 30?02U (t, x) 
U (0,2) = 1 {a = xo} 


7 At the initial date t = 0, we have: 


V (0,2) =e? 2 + : (1 — e715) 
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In Figure A.2, we have represented the probability density function P {X (1) = x | X (0) = 0} 
using the two approaches. For that, we solve the two PDE using finite difference meth- 
ods. Let uf, be the numerical solution of U (ti, £m). By construction, we have: 


ul, = P{X(t) = £m | X (0) =0}- dz 
= P{X (ti) = £m | X (0) =0}-h 


where h is the spatial mesh spacing meaning. Therefore, we have to divide the numer- 
ical solution by h in order to obtain the density. 


Analytical solution 


0 , , . n 
-0.4 -0.2 -0.0 0.2 0.4 0.6 


Feynman-Kac solution Fokker—Planck solution 

3.5 3.5 
3.0 3.0 
2.5 2.5 
2.0 2.0 
1.5 E 
1.0 1.0 
0.5 0.5 
0 0 


.0 .0 
-0.4 -0.2 -0.0 0.2 0.4 0.6 -0.4 -0.2 -0.0 0.2 0.4 0.6 


FIGURE A.2: Density function of the Ornstein-Uhlenbeck process 


2. The solution is given by the Feynman-Kac PDE: 


—ÓàV (t, x) = $a?a?82V (t, x) + uxd,V (t, x) 
V (7,2) =1{« = zr} 


and the Fokker-Planck equation: 


{ &U (t, £) = $072? O2U (t,x) + (2072 — ux) OU (t, x) + (o? — p) U (t, x) 
U (0,2) = 1 {a = zo} 


because: 
Ox [uU (t, x)| = U (t,x) + uxd,U (t, x) 


and: 
82 [o?2?U (t, z)] = 207U (t, x) + 4o?xÓ,U (t, x) + o?z?O2U (t, x) 


In Figure A.3, we have represented the probability density function P {X (1) = x | X (0) = 0} 
using the two approaches. 
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FIGURE A.3: Density function of the Geometric Brownian motion 


A.4.11 Dynamic strategy based on the current asset price 


1. We have: 
dV (t) = n(t)ds (t) 
= f(S(t)nu(tS(t)) dt + f(S()) e (t, S (t)) dW (t) 
2. We notice that 0,F (x) = f (x) and 0?F (x) = f' (x). Using Itó's lemma, we have 
wo = (Sn. se) + r es) dt + 
OF 
ag? tt S (t)) dW (t) 
We deduce that: 
dY (t) = f(S (8) u (t, S(t)) dt + f (S (t) e (t, S (£)) aW (t) + 
5f (S (t) e? (t, 5 (0) at 
3. Since we have: 
dy (t) = dV (t) + jf (S (t)) a? (t, S (t)) dt 
it follows that: 
dV (t) = dY (t) — = f' (S (2) e? (t, S (£)) dt 
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We deduce that: 


V (T) —V (0) 


T 
¥(T)-¥(0)-5 | PEPES at 


II 


1 T: 
F (S (T)) - F (S (0)) - ;] F (S (0) 0? (t, S(t) at 


Finally, we obtain: 


S(T) T 
v(r) = vof f()dz- 5 | f' GS (0) 0? (t, S (1) at 


(0) 2 
= G(T)4+C(T) 
where: 
S(T) 
G(T)=V(0 «f f (a) da 
S(0) 
and: 


T 
cqm--g[ PEPES A 


The first term G(T) can be interpreted as the option profile of the dynamic strategy 
at the maturity date, whereas C (T) is the cost associated to the continuous trading 
strategy. 


. IEf(S(t)) 2 1(S (t) > S}, we have: 


iem f S(T)-S0) #5250) 
[, 1 {a> 8.) de={ S,-S(0) ifS(T)<S, 


and: 


F (2) = -ô (x — S,) 
where ô (a) is the Dirac delta function. By assuming that V (0) = S (0), we deduce 
that: 
1 T 
V(T) = S(T) +(S, —S(T)), - J ô (S (t) — S,) o? (t, S (t)) dt 
0 


'The option profile of this strategy is the underlying asset plus a put option where the 
strike is equal to S*. The cost of the stop-loss strategy is equal to: 


T 
C(T) = y 5(S(t) — $,)o? (t, S(t) dt 
« 0 


. IEf(S(t)) 2 1(S (t) < S,}, we have: 


S(T) _f S(T)-S(0) if S(T) < S. 
i L{e<5,} de={ S, — S (0) if S(T) > S, 


and: 


F (£) = 6(a—S,) 
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Use the results of the previous question, we have: 
1 T 
V(T)2S(T)-(S(T)-S), + F ô (S (t) — S.) o? (t, S (t)) dt 
0 


The option profile of the stop-gain strategy is the underlying asset minus un call 
option where the strike is equal to S,. The cost of the strategy is positive, because 
if we cross the gain level (S (t) > S,), we obtain an additional positive P&L that is 
equal to S (t) — Sx. 


6. (a) We buy the asset (n (t) > 0) when the asset price S (t) is below the price target 
S,. And we sell the asset (n (t) < 0) when the asset price S(t) is above the price 
target S,. This is a contrarian or mean-reverting strategy. 


(b) We have: 
S(T) 
[me ES: dx = mS, m — =| 
S(0) ud *4 (0) 
S(T) 
= m$,ln 5 m (S (T) — S (0)) 
and: g 
f (2) = -m5 
We deduce that: 
V(T)-V (0) = mS, (In S (7) — In $(0)) — m(S(T) —S(0)) + 
m T g? (t, S (t)) 
25 J PA Y 
(c) When we have c (t, S (t)) = o (t) S (t), we obtain: 


S(T) 
S (0) 


m T 

V (T) — V (0) = mS, In -m(S(r)- 50) 75. | c? (t) dt 
0 

and: 
C(T)= TS IV (T) 


where IV (T) is the integrated variance: 


(d) When the asset volatility o (t) is low, the trend of the asset price is strong. It 
means that the asset price can continuously increase or decrease. This is the bad 
scenario for the strategy. The good scenario is when the asset price crosses many 
times the target price S,. This is why the strategy is more performing when the 
realized volatility is high. In Figure A.4, we have illustrated the strategy when 
the asset price follows a geometric Brownian motion and the target price is equal 
to the initial price’. We notice that the number of times that S (t) crosses S, 
increases with the volatility. Therefore, the vega of the strategy is positive. 


8We have S, = S (0) = 100. 
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p= 5% & o = 40% u= -57 & o = 402 
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FIGURE A.4: Impact of the volatility on the mean-reverting strategy 


A.4.12 Strong Markov property and maximum of Brownian motion 
1. We have: 
Pr (W (t) > z} = Pr(W (t) > z, M (t) > a} + Pr(W (t) 2 z, M (t) <z} 
Since M (t) > W (t), we have Pr (W (t) > z, M (t) < x} — 0 and: 
Pr(W()zz) = Pr{W (¢) > z, M (t) > 2} 


= Pr{W(t)>x|M(t)>x}-Pr{M(t)> x) 
Pr(W (t) > £ | Ta € t$ - Pr(M (t) 2 x} 


Using the strong Markov property, we also have: 


Pr{W (t) > z|v, € t] Pr(W(t) — W (tx) > 0| tT. € t) 
= Pr{W(m+t-—t)—W (tz) >0| Ts < t} 


= Pr(W(t—7,) >0| Tt: € t) 


We deduce that: 


and: 
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2. Let z = x — y 2 0. We have: 


Pr {W (t) 22r—9y)] = Pr(W(t) 2x-z,M(t) 2x) 
Pr (W (t) 2 x -z, M (t) <a} 
= Pr(W(t) 2x-c-z,M(t) >a} 


Indeed, we have: 
Pr(W (t) 2 z - z M (t) « 4 20 


because z > 0. It follows that: 
Pr{W(t)>2e—-y} = Pr{W (t) >+ z,M (t) > 2} 
= Pr(W(r,-t—7,) -W(r;)) > z| M(t) 2 x)- 
Pr(M (t) > a} 
Using the strong Markov property and the symmetry of the Brownian motion, we 
deduce that: 
Ty +t —T,) — W ( 
Tr +t — Tx) — W ( 
= Tr) > 2 | Tx €t] 
— Tr) < —z | Te < t} 
t)<a—2z|M(t)>2} 


) 
) 


z| M(t) 2x) 
z| Tx <t} 


Tx) 2 
Tx) 2 


We conclude that: 

Pr{W(t)>20-y} = Pr(W(t) <2- z |M (8) > 3] -Pr(M (0 » 2) 
Pr(W (t) € x ^ z, M (t) > x) 
Pr (W (t) € v, M (t) > x) 


II 


3. The joint density function of (M (t) , W (t)) is defined as follows: 
9? Pr(W (t) < y, M (t) 2 x) 


We have: 
Pr(W(t) «y, M(t) >} = Pr{W (t) 22z— y) 
= 1-0 (2524) 
vit 
We deduce that: 
Ə Pr(W (t) €x y, M (t) > x] 2 2x — y 
T wu) 
It follows that: 
= o 2 2: — y 
Tags A A vt )) 
_ 2 9 (2z — y) 
- aso) 
) 


|. (x-y 2 (2x — y)? 
in 13/2 nm 2t 
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4. WÊ (t) = ut + W (t) is a standard Wiener process under the probability measure Q 
defined by?: 


= = exp (w (t) — T9 
us (w° (t) + jt) 
It follows that: 
fax x) (y) = Firpo we) (x,y) Š 


= 1 Y Qz- y) (2 (2x — y)? 
= exp (w z” ) Ba " exp 7 

_ Qz-y) [2 12, Qz-) 

gui 4 2t 


5. To compute the density of Mx (t), we use the fact that it is the marginal!’ of 
Fx, x) G0 y): 


x 
faux (@) — / fax, x) Gy) dy 
— oo 
2 v (Qe — (2x y)? 
=y Al t V) opu- puèt- Gg dy 
-OQ 
v x 
2 era] -a f "E sy " 
mt E E 
of ee tut o ef "EN à 
Ti Vrt 
= 2 T ud 
= g TE ae -u f 2 uy- hurt (20) dy 
J2nt Vt 


Using the change of variable z = t~!/? (y — ut — 2x), we have: 


" —z-—pgt 
ji 2 ony hurt erg? dy = 2 | ve 1 g2us- i2 dy 
-œ Y Ti =% VOT 


Il 


| 
N 
m 
N 
T 
8 
e 
Lom 
| 
8 
scl 
Xe 
c 
Ss 


Finally, we obtain: 


?Using notations used to state the Girsanov theorem, we have g (t) = p. 
10We have: 


d 1,2, Qe-y)? 1,2, (Qa-y)? Qa — 1,2, (Qa-y)? 
—— eh BH qu = ge V- 3H t— ae 4 Q9 — 9 uuu- da t- 


dy 
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6. We verify that: 
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OF(t) _ OPr{Mx (t) < x} 
Ou da 
= Se) a) 
-aucs ( LE 
m EL (=) aure (7524) 
= jfwuy(x) 


Moreover, we have F (0) = 1 and F (co) = 1. We conclude that F (x) is the probability 


distribution of Mx (t). 


A.4.13 Moments of the Cox-Ingersoll- Ross process 


1. We recall that: 


We deduce that: 


2. We recall that: 


We deduce that: 


var (X (t)) 


3. We recall that: 


We deduce that: 


[Y (4, 0] 2 v6 


1 /4ab cu 
z FX + C£oe 
(1— e7) 0? /4ab 4a ae 
e 
4a c? ' (1— e-at) 2? 


zoe ^ +b (1 — e*t) 


m (5 + inet) 
(1— eat)? o* ( 4ab 8a us 
E zoe ^ 
8a? c? (1— e7) o? 
b 1 
m j= —at\2 2 —at 
(euet (>+ (1— e-at)a pe ) 
E 
m o? xo CR un ca?b Dos) 
a a 2 
nn (Y 0.0) =v +36) ,f/— 
cal (v 4-20? 
4 8 


i (X (t) 


ab 
c 


= t 3cxoe | 


a (3age ** + b(1— e-95)) 


“i 


dabo-? + 2exge-2t)? 


2(1— e-at) 


Ja 


l 


(21967*t + b (1 — e-at))? 
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The excess kurtosis coefficients of Y (v,¢) is equal to: 


| 12(v +42) 
72 (Y (^0) = (era 


It follows that: 
120? (4ab + Aco?xge-**) 
(4ab + 2co2z9e-t)? 
307 (1 — e^**) (4xge ^! + b(1— e-**)) 
a (zoet + b (1 — e-at))? 


(XH) = 


A.4.14 Probability density function of Heston and SABR models 
1. The Fokker-Planck equation is: 
OU (t,21,22) = —Oz, [uziU (t,21,22)| — Ov, |a (b — v2) U (t, 21, x2)] + 
58, [z2z2U (t, 21,22)] + 582, [o?x3U (t, £1, £2)]| + 
pus ak [ox 122U (t, £1, x2)] 


The first-order derivatives are: 


Os, [uziU (t, x ,£2)] = Ax105,U (t, 21,22) + WU (t, 21, 22) 
Ox, [a (b — x3) U (t,x1,2)) =  a(b— 23) Q,,U (t, £1, £2) — aU (t, 21, x2) 


The second-order derivatives are: 


87. [z123U (t, 21, 22)] E 212202 U (t,21,22) + 441 220,,U (t, £1, £2) + 
2x2U (t, 21,22) 
oF |o? 23U (t, 21, £2)| = o7020%,U (t, £1, £2) + 20?0,,U (t, £1, £2) 
and: 
ont [cx4z3U (t,x1,23)] = 04317302. vU (t, 24,22) + 


ozir, U (t, 21,22) + 
oX20,,U (t, 21,22) + oU (t, x1, x2) 


We deduce that: 


1 1 
U (£21,239) = 3 717207, U (t, 21,23) + 39 4305,U (t, 21, 22) F 


pox, t202, ,,U (t, £1, £2) + 
(2x2 + po — H) 3104,U (t, £1, £2) + 
(o? + poza — a (b — x2)) 3x U (t, £1, £2) + 


(a + £2 po — u) U (t, £1, £2) 
2. The Fokker-Planck equation is: 
1 1 
aU (t, £1, £2) = 395. ci^siu (521,72) + 595. [V^z3U (t, £1, £2)] + 


pa. 


21,72 


[yafng0 (t, Ti, 23) 
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The second-order derivatives are: 

OA DOES] E x P3292 ,U (t, £1, £2) + 
4b? r2 Ox, U (t, £1, £2) + 
28 (28 — 1) x29 ?32U (t, £1, £2) 

C [^ z2U (t, T1, £2)| — v 21302 U (t, £1, £2) + 

Av? 230,,U (t, 21,23) + 
29° U (t, £1, £2) 


and: 


Ds z5 [vaiu (t, £1, £2) = vri 202 QU (t, £1, £2) + 
va 130, U (t, 1,22) + 
Bvzi 220,,U (t, £1, £2) + 


2Bvrf tU (t, £1, £2) 


We deduce that: 


1 1 
OU (t, £1, £2) = 321 2305, U (t, 21, 22) + 5 2305,U (t, 21,22) + 


puti t30, sU (t, %1, £2) + 


2 (Bef te, + pv) 15250, U (t, £1, £2) + 
(2? + pBva!) Fas) 22044U (t, 21,22) + 
(3 (28 — 1) 3267732 + V? + 2pBv28— tea) U (t, 21, £2) 


When f is equal to 1, we obtain: 


1 1 
aU (t, z1, £2) = 3717307, U (t, 21,22) + 5" 2302, U (t, £1, £2) + 


pva12202, sU (t, 21,22) + 
2 (£2 + pv) £1£20r, U (t, £1, £2) + 
(21? + pvx2) £2ðr,U (t, £1, £2) + 


(a3 + V? + 2pvxe) U (t, £1, £2) 


3. We have reported the probability density function of Heston and SABR models in 
Figures A.5 and A.6. 


A.4.15 Discrete dynamic programming 


1. (a) We have five states s(k) € {1,1.5,2,2.5} and eight control values c(k) € 
(1,2,3,4,5,6, 7, 8). We deduce that: 


1.141 1.108 1.075 1.038 1.000 
1.121 1.343 1.563 1.782 2.000 
1.842 2.134 2.425 2.714 3.000 
2.759 3.071 3.382 3.691 4.000 
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FIGURE A.5: Probability density function of the Heston model 
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FIGURE A.6: Probability density function of the SABR model 
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Q9 h2 bh2 rn 
Q9 h2 bh2 rn 

Wr dw 
Q9 C32 b2 b2 


3 


For instance, we have J (2,2) = (J)3  — 2.134 because s3 = 2 and k = 2. 

(b) We deduce that 7 (1,1) = 1.141. 

(c) We notice that c^(1) = 1 if s(1) = 1, œ (1) 22 if s(1) = 1.5 or s(1) = 2, 
c(1—23ifs(1225,c(2—1ifs(1)) = 1, œ (2) = 2 if s(1) = 1 bad 
When k is small, the objective function is mainly explained by —$ (c (k) — s (k je 


(c 
Therefore, maximizing f (k,s(k),c(k)) implies that c* (k) ~ s (k). This is why 
c* (k) cannot be greater than or equal to 4. 


J(1 Sk) 


—s(k) = 3 
=== s(k) = 13 
m= s(k) = 22 


Sk 


FIGURE A.7: Values taken by J (k, s(k)) and c* (k) 


2. (a) We have represented 7 (k, s(k)) in the first panel in Figure A.7. 


(b) In the second panel, we have reported J (1, s (k)). The maximum is reached for 
s(k) — 39. 

(c) In the third panel, we have reported the optimal control c* (k) when s(k) is 
equal to 3, 13 and 22. We notice that c* (k) = s(k) when k = 1 and c* (k) = 25 
when k = 100. The case k = 1 has been explained in Question 1(c). In the case 
k — 100, we have: 


f (k,s(k),c(k)) ~ — In s (k) + Bc (k) + y's (k)e*im 50) 


EM 
s (k) 


Therefore, maximizing f (k, s (k) ,c(k)) implies that c* (k) = max cj = 25. 
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A.4.16 Matrix computation 
1. (a) We have A = QTQ* where: 
0.737 —0.553  —0.390 
Q= | 0.526 0.107 0.844 
0.425 0.826 —0.370 
et : 
1.761 0.000 0.000 
T= | 0.000 —0.143 0.000 
0.000 0.000 0.581 
(b) We obtain: 
3.693 1.617 1.682 
e^ = | 1.617 2.895 0.820 
1.682 0.820 1.887 
and: 
—0.371 + 0.9612 0.512 — 0.185% 0.989 — 1.4352 
In A= 0.512 — 0.185;  —0.251 + 0.0367 0.124 + 0.277% 
0.989 — 1.435% 0.124 + 0.277i —1.302 + 2.1457 
(c) We have: 
iA | 5—iA 
2 
and: m A 
ie '* — ie’ 
in A= 
sin 7 
Therefore, we can calculate cos A and sin A from the matrix exponential. We 
obtain: 
0.327  —0.406 —0.391 
cos A = | —0.406 0.554  —0.216 
—0.391 —0.216 0.756 
and: 
0.573 0.209 0.451 
sin A= | 0.209 0.661 0.036 
0.451 0.036 0.155 
We have: 


cos? A + sin? A = I3 


(d) For transcendental functions f (x), we have f (A) = 


yz, we obtain: 


0.836 -- 0.115: 
0.264 — 0.0222 
0.525 — 0.1737 


0.264 — 0.0222 
0.910 + 0.0042 
0.059 + 0.0337 


AV? = 


Qf (T) Q*. Using f (x) = 


0.525 — 0.1732 
0.059 + 0.0332 
0.344 + 0.2582 


2. The eigenvalues of © are —0.00038, 0.00866, 0.01612 and 0.05060. X is not a positive 


semi-definite matrix because one eigenvalue is negative. 


We have X = (A; + i43)? 


where: 
0.19378 0.04539 0.01365 | —0.01440 
Ses 0.04539 0.13513 —0.03221  —0.03445 
0.01365 | —0.03221 0.02740 | —0.02833 
—0.01440 —0.03445  —0.02833 0.08866 
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and: 


0.00024 
—0.00073 
—0.00184 
—0.00083 


Ay = 


We deduce that the nearest covariance matrix is: 


0.04000 

c 0.01499 
0.00196 
—0.00602 


M src 


3. We obtain o (B) = Cs (—25%). This is the lower bound of constant correlation matrix. 


More generally, we have: 


then the lower bound. 
4. We obtain: 


1.0000 
0.6933 
p(C) — | 0.6147 
0.2920 
0.7376 


—0.00073 
0.00224 
0.00563 
0.00255 


p (Cn (r)) = Cn (r7) 


where r* = max (r, —1/ (n — 1)). If r « —1/ (n — 1), the nearest correlation matrix is 


1.0000 
0.4571 
0.7853 
0.2025 


0.01499 
0.02254 
—0.00364 
—0.00745 


— 0.00083 


0.00255 
0.00642 
0.00291 


—0.00602 
—0.00745 
—0.00237 

0.01006 


1.0000 


